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TABLE 7 



SEQ 
ID 
NO: 


Predicted 
beginning 

location 
corresponding 

residue of 
sequence 


Predicted end 

corresponding 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine; M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Trypfccphan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


5359 


337 


1131 


AKLSARLSALILDEVAILPAPQNLSVLSTNMKHLLMWSPVIAPG 
ETVYYSVEYQGEYESLYTSHIWIPSSWCSLTEGPECDVTDDITA 
TVPYHLRVRATLGSQTS/CLEHP/VSIPLIETQPSLPDL/RMEI 
TKDGFHLVIELEDLGPQFEFLVAYWRREPGAEEHVKMVRSGGIP 
VHLETM3PGAAYCVKAQTFVKAIGRYSAFSQTECVEVQGEAIPL 
VLALFAFVGFMLILWVPLFVWKMGRLLQ/YLLLPRGGSSQTPW 
KITQF 


53G0 


2 


1115 


PRVRSSGGQEDPASQQWARPRFTQPSKMRRRVIARPVGSSVRLK 
CVASGHPRPDITWMKDDQALTRPEAAEPRKKKWTLSLKNLRPED 
SGKYTCRVSNRAGAINATYKVDVIORTRSKPVLTGTHPVNTTVD 
FGGTTSFQCKVRSDVKPVIQWLKRVEYGAEGRHNSTIDVGGQKF 
WLPTGDVWSRPDGSYLHKLLITRARQDDAGMYICLGANTMGYS 
FRSAFLTVLPDPKPPGPPVA3SSSATSLPWPWIGIPAGAVFIL 
GTLLLWLCQAQKKPCTPAPAPPLPGHRPPGTARDRSGDKDLPSL 
AALSAGPGVGLCEEHGSPAAPQHLLGPGPVAGPKLYPKLYTGHS 
TPIITYTHPPPSCQLNSSHS 


53 61. 


3 


925 


HEGS I S SANILLDDQFQPKLTDFAMAHFRSHLEHQSCT1MMTSS 
SSKHLWYMPBEYIRQGKLSIKTDVYSFGIVIMEVLTGCRVVJjDD 
PKHIQLRDLLRELMEKRGLDSCLSFLDKKVPPCPRNFSAKLFCL 
AGRCAATRAKLRPSMDEVLNTLESTQASLYFAEDPPTSLKSFRC 
PSPLFLENVPSIPVEDDESQlslNNLLPSDEGLRIDRMTQKTPFEC 
SQSEVMFLSLDKKPESKRNEEACNMPSS8CEESWFPKYIVPSQD 
LRPYKVNIDPSSEAPGHSCRSRPVE I EQY1 


5362 


2 




SCQVSGCTRTYNSSQSIGKHMKTAHPDQYAAFKMQRKSKKGQKA 
NNLNTPHN3KFVYFLPSPVNSSNPFFTSQTKANGKPACSAQLQH 
VSPPIFPAHLASVSTPLLSSMKSVINPNITSQDKNEQGGMLCSQ 
MENLPSTALPAQMEDLTKTVLPLNIDRG3DPFLSLPAESSSIDL 
FPSPADSGTNSVFSQLENNTNHYSSQIEGNTNSSFLKGGNGENA 
VFPSQVNVANNFSSTNAQQSAPEKVKKDRGRGQTCKERKPKHNK 
RAKWPAIIRDGKFICSRCYRAFTNPRSLGGHLSKRSYCKPLDGA 
EIAQELLQSNGQPSLLASMILSTNAVNLQQPQQSTFNPEACFKD 
PSFLQLLAENRSPAFLPNTFPRSGVTNFNTSVSQEGSEIIIQAL 
ETAGIPSTFEGAEM1SHVSTGCVSDASQVNATVMPNPTVPPLLH 
TVCHPNTLLTNQNRTSNSKTSSIEECSSLPVFPTIIDLLLKTVEN 
GLCSSSFPNSGGPSQNFTSNSERVSVISGPQNTRSSHLNKKGNS 
ASKRRKKVAPPLIAPNASQNLVTSDLTTMGLIAKSVEIPTTNLH 
SNVIPTCEPQSLVENLTQKLWNVNNQLFMTDVKEKFKTSLESHT 
VLAPLTLKTENGDSQMMALHSCTTSVNSDLQ1SEDNVIQNFEKT 
LEIIKTAMNSQILEVKSGSQGAGETSQNAQINYNIQLPSVNTVQ 
NNKLPDSSP\FSSFISVMPTESNIPQSE\VSHKEDQIQEILEGL 
QKLKLENDLSTPASQCVLINTSVTLTPTPVKSTADITVIQPVSE 
MINIQFNDKVNKPFVCQNQGCNYSAMTKDALFKHYGKIHQYTPE 
MIL3IKKNQLKFAPFKCWPTCTKTFTRNSNLRAHCQLVHEFTT 
EEMVKLKIXRPYGRKSQSENVPASRSTQVKKQLAMTEENKKESQ 
PALELRAETQNTHSNVAVIPEKQLIEKKSPDKTESSLQVITVTS 
EQCNTNALTNTQTKGRKIRRHKKEKEEKKRKKPVSQSLEFPTRY 
SPYRPYRCVHQGCFAAFTIQQNLILHYQAVHKSDLPAFSAEVEE 
ESEAGKESEETETKQTLKEFRCQVSDCSRIFQAITGLIQHYMKL 
HEMTPEEIESMTASVDVGKFPCDQLECKSSFTTYLNYWHLEAD 
HGIGLRASKTEEDGVYKCDCEGCDRIYATRSNLLRHIFMKHNDK 
HKAHLIRPRRLTPGQENMSSKANQEKSKSKHRGTKHSRCGKEGI 
KMPKTKRKKKNNLENKNAKI VQ IEENKPYS LKRGKHVYS IKARN 
DALSECTSRFVTQYPCMIKGCTSWTSESNIIRHYKCHKLSKAF 
TSQHRNLLIVFKRCCNSQVKETSEQEGAKNDVKDSDTCVSESND 
tTSRTTATVSQKEVEKWE*DEMDELTELFITKLINEDSTSVETQA 
NTSSNVSNDFQEDNLCQSERQKASNLKRVNKEKNVSQNKKRKVE 
KAEPASAAELSSVRKEEETAVAIQTIEEHPASFDWSSFKPMGFE 
VSFLKFLEESAVKQKKNTDRDHPNTGNKKGSHSNSRKNIDKTAV 
TSGNHVCPCKESETFVQFANPSQLQCSDNVKIVLDKNLKDCTEL 
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SEQ 
ID 


Predicted 
beginning 

location 
corresponding 
to first 
amino acid 

sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








VLKQLQEMKPTVSLKKLEVHSNDPDMSVMKDISIGKATQRGQY 


5363 


8065 


703 


RLCCTGGGEGTPGASGKRGPAATTSLVLCIPSVPPPVPFPTLWP 
PPSWRRQPPGGIRRDFSRRLRREANLVATCLPVRASLPHRLNML 
RGPGPGLLLLAVLCLGTAVPSTGASKSKRQAQQMVQPQSPVAVS 
QSKPGCYDNGKHYQINQQWERTYLGNALVCTCYGGSRGFNCESK 
PEAEETCFDKYT3NTYRVGDTYERPKDSMIWDCTCIGAGRGRIS 
CTIANRCHEGGQSYKIGDTWRRPHETGGYMLECVCLGNGKGEWT 
CKPIAEKCFDHAAGTSYWGETWEKPYCGWMMVDCTCLGEGSGR 
ITCTSRNRCNDQDTRTSYRIGDTWSKKDNRGNLliQCICTGNGRG 
EWKCERHTSVQTTSSGSGPFTDVRAAVYQPQPHPQPPPYGHCVT 
DSGWYSVGMQIA*KTQGNKQML\CTCLGNGVSCQETAVTQTYG 
GNSNGEPCVLPFTYNGRTFYSCTTEGRQDGHLWCSTTSIJYEQDQ 
KYS FCTDHTVLVQTRGGNSNGALCHFPFLYNNHNYTDCTS EGRR 
DNMKWCGTTQNYnADQKFGFCPMAAHEEICTTNEGVMYRIGDQW 
DKQHDMGHMMRCTCVGNGRGEWTCIAYSQLRDQCIVDDITYNVN 
DTFHKRHEEGHMLNCTCFGQGRGRWKGDPVDQCQDSETGTFYQI 
GDSWEKYVHGVRYQCYCYGRGIGEWHCCPLOTYPSSSGPVEVFI 
TETPSQPNSHPIQWNAPQPSHISKYILRWRPKNSVGRWKEATIP 
GHLNSYTIKGLKPGWYEGQLISIQQYGHQEVTRFDFTTTSTST 
PVTSNTWTGETTEFSPLVATSESVTEITASSFWSWVSASDTV 
SGFRVEYELSEEGDEPQYLVLPSTATSV\NIP\DLLPGRKY1VN 
VYQISEDGEQSLILSTSQTTAPDAPPDPTVDQVDDTSIWRWSR 
PQAPIT3YRIVYSFSVEGSSTELNLPETANSVTLSDLQPGVQYN 
I T Z YAVE EKC'ESTPWI QQETTGTPRSDTVPS PRDLQFVEVTDV 
KVTIMWTPPESAVTGYRVDVIPVNLPGEHGQR1.PLSRNTF\AEK 
TGIiSPGVTYYFKVFAVSHGRESKPLTACQTTKL\DAPTIILQFVN 
ETDSTVXjVRWTPPRAQITGYRLTVGLTRRGQPRQYNVGPSVSKY 
PLRNLQPASEYTVSLVAIKGNQESPKATGVFTTLQPGSSIPPYN 
TEVTETTIVITWTPAPRIGFKLGVRP3QGGEAPREVTSDSGS IV 
VSGLTPGVEYVYTIQVLRDGQERDAP\IVNK\WTPLSPPTNLH 
LEANPDTGVLTVSftERSTTPDITGYRITTTPTNGQQGNSLEEW 
HADQSSCTF\DNLEVPGLEYNVSVYTVKDDKESVPISDTIIPAV 
PPPTDLRFTN/ILGPDTMRVTW\APPPSIDLTNFLVRYSPVKNE 
GRMLQSLSIFFLBDN\AVVLTNLLPGTEYWSVSSVYEQHBSTP 
\LRGRQKTGLDSP\TGIDFS\DITA\NSFT\VHW\IAPRA/TPI 
TGYRIR\HHPEHF\SGRPREDR\VPH3RNSITLTNLTPGTEYW 
EIVALNGREESPLLIGQQSTVSDVPRDLEWAATPTSLLISSWD 
APAVTVRYYRITYGETGGNSPVQEFTVPGSKSTATISGLKPGVD 
YTITVYAVTGRGDS PAS S KPIS INYRTE IDKPSQMQVTDVQDNS 
ISVKWLPSSSPVTGYRVTTT\PKNGPG\PTKTKTAGPDQT3MTI 
EGLQPTVEYWSVYAQNPSGESQPLVQTAVTNIDRPKGLAFTDV 
DVDSIKIAWESPQGQVSRYRVTYSSPEDGIHELFPAPDGE3DTA 
ELQGLRPGSEYTVSWALHDDMESQPLIGTQSTAIPAPTDLKFT 
QVTPTSLSAQWTPPNVQLTGYRVRVTPKEKTGPMKEINLAPDSS 
SVWSGLMVATKYEVSVYALKDTLTSRPAQGWTTLENVSPPRR 
ARVTDATETTITISWRTKTETITGFQVDAVPAWGQTPIQRTIKP 
DVRSYTITGLQPGTDYKIYLYTLNDWARSSPWIDASTAIDAPS 
NLRFLATTPNSLLVSWQPPRARITGYIIKYEKPGSPPREWPRP 

QLVTLPHPNLHGPEILDVPSTVQKTPFVTHPGYDTGNGIQLPGT 
SGQQPSVGQQMIFEEHGFRRTTPPTTATPIRHRPRPYPPNVGQE 
ALSQTTISWAPFQDTSEYIISCHPVGTDEEPLQFRVPGTSTSAT 
LTGLTRGATYNIIVEALKDQQRHKVREEWTVGNSVWEGLNQPT 
DDSCFDPYTVSHYAVGDEWERMSESG7KLLCQCLGFGSGHFRCD 
SSRWCHDNGVNYKIGEKWDRQGEKGQMMSCTCLGNGKGEFKCDP 
HEATCYDDGKTYHVGEQWQKEYLGAICSCTCFGGQRGWRCDNCR 
RPGGEPSPEGTTGQSYNQYSQRYHQRTNTNVNCPIECFMPLDVQ 
ADREDSRE 


5354 


8066 


703 


RLCCTGGGEGTPGASGKRGPAATTSLVLCIPSVPPPVPFPTLWP 
PPSWRRQPPGGIRRDFSRRLRREANLVATCLPVRASLPHRLNML 
RGPGPGLLLIAVLCLGTAVPSTGASKSKRQAQQMVQPQSPVAVS 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 

sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
airdno acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glut amine, R=Arginlne, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








QSKPC-CYDNGKHYQINQQWERTYLGNALVCTCYGGSRGFNCESK 
PEAEETCFDKYT3NTYRVGDTYERPKDSMIWDCTCIGAGRGRIS 
CTIANRCHEGGQSYKIGDTWRRPHETGGYMLECVCLGNGKGEWT 
CKPIAEKCFDHAAGTSYWGETWEKPYQGWMMVDCTCLGEGSGR 
ITCTSRNRCNDQDTRTSYRIGDTWSKKDNRGNLLQCICTGNGRG 
EWKCERHTSVQTTSSGSGPFTDVRAAVYQPQPHPQPPPYGHCVT 
DSGWYSVGMQLA*KTQGNKQML\CTCLGNGVSCQETAVTQTYG 
GNSNGEPCVLPFTYNGRTFYSCTTEGRQDGHLWCSTTSNYEQDQ 
KYSFCTDHTVLVQTRGGNSNGALCHFPFLYNNHNYTDCTSEGRR 
DNMKWCGTTQNYDADQKFGFCPMAAH3EICTTIJEGVMYRIGDQW 
DKQHDMGHMMRCTCVGNGRGEWTCIAYSQLRDQCIVDDTTYNVN 
DTFHKRHEEGHMLNCTCFGQGRGRWKCDPVDQCQDSETGTFYQI 
GDSWEKYVHGVRYQCYCYGR31GEWHCQPLQTYPSSSGPVEVFI 
TETPSQPNSHP1QWNAPQPSH1SKYILRWRPK1JSVGRWKEATIP. 
GHLNS YT I KGLKPGWYEGQLI S I QQYGHQEVTRFDFTTTSTST 
PVTSNT\VTGETTPFS PLVATSES VTE ITASS FWSWVSASDTV 
SGFRVEYELSEEGDEPQYLVLPSTATSV\NIP\DjLPGRKYIVN 
VYQISEDGEQSLILSTSQTTAPDAPPDPTVDQVDDTSIWRWSR 
PQAPITGYRIVYSPSVEGSSTELNLPETANSVTLSDLQPGVQYN 
ITIYAVEENQESTPWIQQETTGTPRSDTVPSPRDLQFVEVTDV 
KVTIMWTFPESAVTGYRVDVI PVNLPGEHGQRLPLSRNTF \AEN 
TGLSPGVTYYFKVFAVSHGRESKPLTAQQTTKL\DAPTNLQFVN 
ETDSTVLVRWTPPRAQITGYRLTVGLTRRGQPRQYNVGPSVSKY 
PLRNLQPASEYTVSLVAIKGNQESPKATGVFTTLQPGSSIPPYN 
TEVTETTIVITWTPAERIGFKLGVRPSQ3GEAPREVTSDSGSIV 
VSGLTPGVEYVYTIQVLRDGQERDAP\IVNK\WTPLSPPTNLH 
LEANPDTGVLTVSWERSTTP3ITGYRITTTPTNGQQGNSLEEW 
HADQSS CTF\DNLEVPGLE YNVS VYTVKDDKES VP ISDTIIPAV 
PPPTDLRFTN/1LGPDTMRVTW\APPPSIDLTNFLVRYSPVKNE 
SRMLQSrjSIFFLSDN\AWLTNLLPGTEYWSVSSVYEQHESrP 
\LRGRQKTGLDSP\TGIDFS\DITA\WSFT\VHW\IAPRA/TPI 
TC-YRIR\HHPEHF\SGRPREDR\VPHSRNSITLTHLTPGTEYW 
SIVALNGREESPLLIGQQSTVSDVPRDLEWAATPTSLLI\SWD 
APAVT VR Y YR I T YG ETGGNE P VQE FTVPGS KS TATI S QLKPG VD 
YT ITVYAVTGR GDSPASSKPISINYRTEI DKPS QMQVTDVQDNS 
ISVKWL?SSSPVTGYRVTTT\PKNGPG\PTKTKTAGPDQTEMTI 
EGLQPTVEYWSVYAQNPSGESQPLVQTAVTNIDRPKGLAFTDV 
DVDSIKIAWESPQGQVSRYRVTYSSPEDGIHELFPAPDGEEDTA 
ELQGLRPGSEYTVSWALHDDMESQPLIGTQSTAIPAPTDLKFT 
QVTPTSLSAQWTPPNVQLTGYRVRVTPK3KTGPMKEINLAPDSS 
SWVSGLMVATKYEVSVYALKDTLTSRPAQGWTTLENVSPPRR 
ARVTDATETTITI SWRTKTETITGFQVDAVPANGQTP IQRTI KP 
DVRSYTITGLQPGTDYKIYLYTLNDNARSSPVVIDASTAIDAPS 
NLRFLATTPNSLLVSWQPPRARITGYIIKYEKPGSPPREWPRP 
RPGVTEATITGLEPGTEYTIYVIALKNNQKSEPLIGRKKTDELP 
QLVTLPHPNLHGPEILDVPSTVQKTPFVTHPGYDTGNGIQLPGT 
SGQQP SVGQQM I FEEHGFRRTTPPTTATP IRHRPRP YPPNVGQE 
ALSQTT I SWAP FQDTSEY1 1 SCHPVGTDEEPLQFRVPGTSTSAT 
LTGLTRGATYNIIVEALKDQQRHKVREEWTVGNSVNEGLNQPT 
DDSCFDPYTVSHYAVGDEWERMSESGFKLLCQCLGFGSGHFRCD 
S3RWCHDNGVNYKIGEKWDRQGENGQMMSCTCLGNGKGEFKCDP 
HEATCYDDGKTYHVGEQWQKEYLGAICSCTCFGGQRGWRCDNCR 
RPGGEPSPEGTTGQSYNQYSQRYHQRTNTNVNCPIECFMPLDVQ 
ADREDSRE 


53S5 


8056 


703 


RLCCTGGG3GTPGASGKRGPAATTSLVLCIPSVPPPVPFPTLWP 
PPSWRRQPPGGIRRDFSRRLRREANLVATCLPVRASLPHRLNML 
RGPGPGLLLLAVLCLGTAVPSTGASKSKRQAQQMVQPQSPVAVS 
QSKPGCYDNGKHYQINQQWERTYLGNALVCTCYGGSRGFNCESK 
PEAEETCFDKYTGNTYRVGDTYERPKDSMIWDCTCIGAGRGRIS 
CTIAWRCHEGGQSYKIGDTWRRPHETGGYMLECVCLGNGKGEWT 
CXPIAEKCFDHAAGTSYWGETWEKPYQGWMMVDCTCLGEGSGR | 
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SEQ 
ID 
MO: 


Predicted 

nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


nucleotide 

corresponding 
to first 

residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=3fcop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








I TCTSRNECNDQDTKTS YR IGDTWSfCKDNRGNLLQCI CTGNGRG 
EWKCERHTSVQTTSSGSGPFTDVRAAVYQPQPHPQPPPYGHCVT 
DSGWYSVGMQLA*KTQGNKQML\CTCLGWGVSCQETAVTQTYG 
GNSNGEPCVLPFTYNGRTFYSCTTEGRQDGHLWCSTTSNYEQ.DQ 
KYSFCTDHTVLVQTRGGNSNGALCHFPFLYNNHNYTDCTSEGRR 
DNMKWCGTTQNYDADQKFGFCPMAAHEEICTTNE3VMYR1GDQW 
DKQHDMGHMMRCTCVGNGRGEWTCIAYSQLRDQCIVDDITYNVN 
DTFHKRHEEGHMLNCTCFGQGRGRWKCDPVDQCQDSETGTFYQI 
GDSWEKYVHGVRYQCYCYGRGIGEWHCCPLQTYPSSSGPVEVFI 
TETPSQPNSHPIQWNAPQPSHISKYILRWRPKNSVGRWKEATIP 
GHLNSYTIKGLKPGWYEGQLISIQQYGHQEVTRFDFTTTSTST 
PVTSNT WTGETTPFS PLVATS ES VTE ITAS SF WSWVS ASDTV 
SGFRVEYELSEEGDEPQYLVLPSTATSV\NIP\DLLPGRKYIVW 
VYQISEDGEQSLILSTSQTTAPDAPPDPTVDQVDDTSIWRWSR 
PQAPIT3YRIVYSPSVEGSSTELHLPETANSVTL3DLQPGVQYN 
ITIYAVEENQESTPWIQQErTGTPRSDTVPSPRDLQFVEVTDV 
KVTIMWTEPESAVTGYRVDVIPWLPGEHGQRLPLSRNTF\AEN 
TQLSPGVTYYFKVFAVSHGRESKPLTAQQTTKL\DAPTNLQFVN 
ETDSTVLVRWT P PRAQ I TG YRLTVGLTRRGQPRQYNVGP S VSKY 
PLRNLQPASEYTVSLVAIKGNQESPKATGVFTTLQPGSSIPPYN 
TEVTETTIVITWTPAPRIGFKLGVRPSQGGEAPREVTSDSGSIV 
VSGLTPGVEYVYTI QVLRDGQERDAP \ I VNK\ WTPLSPPTNLH 
LEANPDTGVLTVSWERSTTPDITGYRITTTPTNGQQGNSLEEW 
HADQSSCTF\DNI.,F.VPGLEYNVSVYTVKDDKESVPISDTIIPAV 
PPPTDLRFTN/ ILGPDTMRVTW\APPPSIDLTNFLVRYSPVKNE 
3KKLQSLSiFFL3DN\AVVLTNLLPGTEYWSV8SVYEQHESTP 
\LRGRQKTGLDSP\rGIDFSVDITA\lISFT\VHW\IAPRA/TPI 
TGYRIR\HHPEHF\SGRPREDR\VPHSRNSITLTWLTPGTEYVV 
S1VAL,NGREESPLLIGQQSTVSDVPRDLEWAATPTSIjLI\SWD 
APAVTVRYYRITYGETGGNSPVQEFTVPGSKSTATISGLKPGVD 
YT I T V YAVTGRGDS PAS S KP I S IN YRXE I DKPS QMQ VTE VQDNS 
ISVKWLPSSSPVTGYRVTTT\PKNGPG\PTKTKTAGPDQTEMTI 
EGLQPTVEYWSVYAQNPSGESQPLVQTAVTNIDRPKGLAFTDV 
DVDSIKIAWESPQGQVSRYRVTYSSPEDGIHELFPAPDGEEDTA 
ELQGLRPGSEYTVSWALHDDMESQPLIGTQSTArPAPTDLKFT 
QVTPTSLSAQWTPPNVQLTGYRVRVTPKEKTGFMKEINLAPDSS 
SVWSGLMVATKYEVSVYALKDTLTSRPAOGWTTLENVSPPRR 
ARVTDATETTITISWRTKTETITGFQVDAVPANGQTPIQRTIKP 
DVRSYTITGLQPGTDYKIYLYTLNDNARSSPWIDASTAIDAPS 
NLRFLATTPNSLLVSWQPPRARITGYIIKYEKPGSPPREWPRP 
RPGVTEATITGLEPGTEYTIYVIALKNNQKSEPLIGRKKTDELP 
QLVTLPHPMLHGPEILDVPSTVQKTPFVTHPGYDTGNGIQLPGT 
SGQQFSVGQQMIFEEHGFRRTTPPTTATPIRHRPRPYPPNVGQE 
ALSQTTISWAPFQDTSEYIISCHPVGTDEEPLQFRVPGTSTSAT 
LTGLTRGATYNIIVEALKDQQRHKVREEWTVGNSVHEGLNQPT 
EDSCFDPYTVSHYAVGDEWERMSESGFKLLCQCLGFGSGHFRCD 
SSRMCHDNGVNYKIGEKWDRQGENGQMMSCTCLGNGKGEFKCDP 
H3ATCYDDGKTYEVGEQWQKEYLGAICSCTCFGGQRGWRCENCR 
RPGGEP3PEGTTGQSYNQYSQRYHQRTNTNYNCPIECFMPLDVQ 
ADREDSRE 


53SS 


80S6 


703 


- rGGGEGTPGASGKRGPAATTSLVLCIPSVPPPVPFPTLWP 
PPSWRRQPPGGIRRDFSRRLRREANLVATCLPVRASLPHRLNML 
RGPGPGLLLLAVLCLGTAVPSTGASKSKRQAQQMVQPQSPVAVS 
' S VYDNGKHYQINQQWERTYLGNALVCTCYGGSRGFNCESK 
PEAEETCFDKYTGNTYRVGDTYERPKDSMIWDCTCIGAGRGRIS 
CTIANRCHEGGQSYKIGDTWRRPHETGGYMLECVCLGNGKGEWT 
CKPIAEKCFDHAAGTSYWGETWEKPYQGWMMVDCTCLGEGSGR 
ITCTSRNRCNDQDTRTSYRIGDTWSKKDWRGWLLQCI CTGNGRG 
f 1 ^-TS"QTTSSGSGPFTDVRAAVYQPQPHPQPPPYGHCVT 
DSGWYSVGMQLA* KTQGNKCML\ CTCLGNGVS CQETAVTQTYG 
GM - ,bPCVLPFTYNGRTFYSCTTEGRQDGHLWCSTTSNYEQDQ 
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SEQ 
ID 
.NO: 


beginning 
nucleotide 
location 
corresponding 


Predicted end 
nucleotide 
location 
corresponding 
to first 

residue of 


Amino acid segment containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, R= 
Glutamic Acid, F=Phenylalanine , 3=Glycine, 
H=Histidine, l=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








KYS FC rnHTVLVQTRGGNSNGALCHFPFLYNNHNYTDCTSEGRR 
DNMKWCGTTQNYDADQKFGFCPMAAHEEICTTNEGVMYRIGDQW 
DKQKDMGHMMRCTCVGNGRGEWTCIAYSQLRDQCIVDDITYNVN 
DTFHKRHEEGHMLNCTCFGQGRGRWKCDPVDQCQDSETGTFYQI 
GDSWE KYVHGVRYQCYCYGRGI GE WHCQ PLQTYPS SSGPVEVF I 
TETPSQPNSHPIQWNAPQPSHISKYILRWRPKNSVGRWKEATIP 
GHLNSYTIKGLKPGWYEGQLISIQQYGHQEVTRFDFTTTSTST 
PVTSNT\VTGETTPFSPLVATSESVTEITASSFWSWVSASDTV 
SGFRVEYELSEEGDEPQYLVLPSTATSV\NIP\DLLPGRKYIVN 
VYQ ISEDGEQSLI LSTSQTTAPDAPPDPTVDQVDDTS IWRWSR 
PQAPITGYRIVYSPSVEGSSTELNLP3TANSVTLSDLQPGVQYN 
IT1YAVEENQESTPWIQQETTGTPRSDTVPSPRDLQFVEVTDV 
KVT IMWTP PESAVTG YRVDVI PVNLPGEHGQRLPLSRNTF \AEN 
TGLSPGVTYYFKVFAVSHGRESKPLTAQQTTKL\DAPTNLQFVN 
ETDSTVLVRWTPPPAQITGYRLTVGLTRRGQPRQYNVGPSVSKY 
PLRNLQPASEYTVSLVAIKGNQESPKATGVFTTLQPGSSIPPYN 
TEVTETTIVITWTPAPRIGFKLGVRPSQGGEAPREVTSDSGSIV 
VSGLTPGVEYVYTIQVLRDGQERDAP\IVNK\WTPLSPPTNLH 
LEANPDTGVLTVSWERSTTPDITGYRITTTPTNGQQGNSLEEVV 
HADQSSCTF\DNLEVPGLEYNVSVYTVKDDKESVPISDTIIPAV 
PPPTDLRFTN/ ILGPDTMRVTW\APPPSIDLTNFLVRYSPVKNE 
GRMLQSLSIFFLSDN\AWLTNLLPGTEYWSVS3VYEQHESTP 
\LRGRQKTGLDSP\TGIDFS\DITA\NSFT\VHW\IAPRA/TPI 
TGYRIR\HHPEHF\SGRPREDR\VPHSRNSITLTNLTPGTEYW 
SIVALNGREES?LLIGQQSTVSDVPRDLEWAATPTSLLl\SWD 
APAVTVRYYRITYGETGGNSPVQEFTVPGSKSTATrSGLKPGVD 
YT ITVYAVTGRGDS PASSKP I S INYRTE IDKPS QMQVTDVQDNS 
ISVKWLPSSSPVTGYRVTTT\PKNGPG\PTKTKTAGPDQTEMTI 
EGLQPTVEYWSVYAQNPSGESQPLVQTAVTNIDRPKGLAFTDV 
DVDSIKIAWESPQGQVSRYRVTYSSPEDSIHELFPAPDGEEDTA 
3LQGLRPGSKYTVSVVALHDDMESQPLIGTQSTAIPAPTDLKFT 
QVTPTSLSAQWTPPNVQLTGYRVRVTPKEKTGPMKEINLAPDSS 
SVWSGLMVATKYEVSVYALKDTLTSRPAQGWTTLENVSPPRR 

DVRSYTITGLQPGTDYKIYLYTLNDNARSSPWIDASTAIDAPS 
NLRFLATTPNSLLVSWQPPRARITGYIIKYEKPGSPPREWPRP 
RPGVTEATITGLEPGTEYTIYVIALKNNQKSEPLIGRKKTDELP 
QLVTLPHP.NTLHGPEILDVPSTVQKTPFVTHPGYDTGKrGIQLPGT 
SGQQPSVGQQMIFEEHGFRRTTPPTTATPIRHRPRPYPPNVGQE 
ALSQTTI S WAPFQDTSEYI IS CHPVGTD3EPLQFRVPGTS TSAT 
: 1 •.. I PGA.TYNIIVEALKDQQRHKVREEWTVGNSVNEGLNQPT 
DDSCFDPYTVSHYAVGDEWERMSESGFKLLCQCLGFGSGHFRCD 
SSRWCHDNGVNYKIGEKWDRQGENGQMMSCTCLGNGKGEFKCDP 
H3ATCYDDGKTYHVGEQWQKEYLGAICSCTCFGGQRGWRCDNCR 
RP3GEPSPEGTTGQSYNQYSQRYHQRTNTNVNCPIECFMPLDVQ 
ADREDSRE 


5367 


235 


3591 


KKILNMLCKKNIVIEYLADILYEYLYGFCFSGIKKYLIIHVLRL 
„C_ riTRILLEKSVSLQTQYLLLIVKILSWFPGKEMRHHLQIM 
EVMMRKQDS/RIVGNGSEQQLQKELADVLMDPPMDDQPGEKELV 
KRSQLDGEGDGPLSNQLSASSTINPVPLVGLQKPEMSLPVKPGQ 
GDSEASSP FTPVADEDS WFS KLTYLC-CAS VNAPRS EVEALRMM 
S I LRSQCQI SLDVTLSVPNVS EGIVRLLDPQTNTE IANYP I YKI 
LFCVRGHDGTPESDCFAFTESHYNAELFRIHVFRCEIQEAVSRI 
LYSFATAFRRSAKQTPLSATAAPQTPDSDIFTFSVSLEIKEDDG 
KGYFSAVPKDKDRQCFKLRQGIDKKIVIYVQQTTNKELAIERCF 
GL1LSPGKDVRNSDMHLLDLESMGKSSDGKSYVITGSWNPKSPH 
FQWNEETPKDKVLFMTTAVDLVITEVQEPVRFLLETKVRVCSP 
NERLFWPFSKRSTTENFFLKLKQIKQRERKNNTDTLYEWCLES 
ES3RERRK7TASPSVRLPQSGSQSSVIPSPPEDDEEEDNDEPLL 
SGSGDVSKECAEKILETWGELLSKWHLNLNVRPKQLSSLVRNGV 
P3ALRGEWQLLAGCHHNDHLVEKYRILITKESPQDSAITRDIN 
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Amino acid segment containing signal peptide 
(A=Alanine, C-Cysteine, D=Aspartic Acid, E- 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, l=Isoleucine, K=Lysine, 
L=Leucine, M=Kethionine, N=Asparagine, 
P=Proline, Q=Glutaraine , R=Arginine, 
S=Serine, T= Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 

\=possible nucleotide insertion) 








RTFPAHDYFKDTGGDGQDSLYKICKAYSVYDEEIGYCQGQSFLA 
AVLLLHMPEEQAFSVLVKIMFDYGLRELFKQNFEDLHCKFYQLE 
RLMQEYI PDLYNHFLDISLEAHMYASQWFLTLFTAKFPLY.MVFH 
IIDLLLCEGISVIFNVALGLLKTSKDDLLLTDFEGALKFFRVQL 
PKRYRSEENAKKLMEIACNMKISQKKLKKYEKSYHTMREQQAQQ 
EDP I ERFERENRRLQEANMRLEQENDDLAHELVTS KI ALRKDLD 
NAEEKADALNKELLMTKQKLIDAEEEKRRLEEESAHLKKMCRRE 
LDKAESEIKKNSSIIGDYKQICSQLSERLEKQQTANKVEIEKIR 
QKVDDCERCREFFNKEGRVKG1SSTKEVLDEDTDEEKETLKNQL 
REMELELAQTKL\QLVEAECK1QD\LEHPF*GIjPFNE\VQAA\k 
KTWFNRTLSSIKTATGVQGKETC 


536B 


573 


2014 


GAAAGAAD PRP.G S LGGRTMLD FA I FAVT FLIAL VGAVLYL YPAS 
RQAAGIPGITPT2EKDGNLPDIWSGSLHEFLVNLHERYGPWS 
FWFGRRL WSLGTVDVLKQHINPNKTLD /LF*NHAEVI IKVS IW 
WWQCE * KP \ QRKKLYENGVTDSLKSNFALLLKLPEELLDKWLS Y 
PETQH\VFLSQHMLGFAMKSVTQMVMGSTFEDDQEVIRFQXNHG 
TVWSEIGKGFLDGSLDKNMTRKKOYEDALMQLESVLRNIIKERK 

AIWFLTTSEEVQKKLYEEINQVFGNGPVTPEKIEQLRYCQHVLC 
ETVRTAKLTP VSAQLQD I EGKIDRF 1 1 PRETLVL YALGWLQDP 
NTWPSPHKFDPDRFDDELVMKTFSSLGFSGTQECPELRFAYMVT 
TVLLSVLVKRLHLLSVEGQVIETKYELVTSSREEAWITVSKRY 


53S9 




6622 


PRSLCF3 LWAEAAVLADGGLRRRRRLT iRGTMSAS FVPNGASLED 
CHCNLFCLADLTGIKWKKYV'rtQGPTSAPlLFPVTEEDPILSSFS 
RCLKADVLG/VWRRDQRPERRE\L* I FWGGEDP \VLLTLFTMTY 
QKK1CMECGRMOFPMNAVLCFSKAVHNLLERCLMNRNFVRIGKWF 
VKPYEKDEKP1NKSEHLSCSFTFFLHGDSOTCTSVEINQHQPVY 
LLSEEH I TLAQQSNS P FQVI LCPFGLNGTLTGQAFKMSDSATKK 
LIGEWKQFYPISCCLKEMSE3KQEDMDWEDDSL,AAVEVLVAGVR 
MIYPACFVLVPQSDIPTPSPVGSTHCSSSCLGVHQVPASTRDPA 
MSSVTLTPPTSPEEVQTVDPQSVQKWVKFSSVSDGFNSDSTSHH 
GGKIPRKLANHWDRVWQECNMNRAQNKRKYSASSGGLCEEATA 
AKVASWDFVEATQRTNCSCLRHKNLKSRNAGQQGQAPSLGQQQQ 
ILPKHKTNKKQEKSEKPQKRPLTPFHHRVSVSDDVGMD\ADS\A 
SQRLV\ ISAP\DSQ\VRFSNIR\TNDVAK\TPQMHGTEMANSPQ 
PPPLSP\HPCDWDEGVTKT?STPQSQHFYQMPTPDPLVPSKPM 
EDRIDSL5QSFPPQYQEAVEPTVYVGTAVNLEEDEANIAWKYYK 
FPKKKDVEFLPPQLPSDKFKDDPVGPFGQESVTSVTELMVQCKK 
PLKVSDELVQQYQIKNQCLSAIASDAEQEPKIDPYAFVEGDEEF 
LFPDKKDRQNSEREAGKKHKVEDGTSSVTVLSKEEDAMSIiFSPS 
IKQDAPRPTSHARPPSTSLIYDSDLAVSYTDLDNLFWSDEDELT 
PGSKRSANGSDDKASCKESKTGNLDPLSCISTADlHKMYPTPPS 
LEQHIMGFSPMNMNNKEYGSMDTTPGGTVLEGNSSSIGAQFKIE 
VDEGFCSPKPSEIKDFSYVYKPENCQILVGCSMFAPLKTLPSQY 
LPLIKLPEECIYRQSWTVGKLELLSSGPSMPFIKEGDGSKMDQE 
YGTAYTPQTHTSCGMPPSSAPPSNSGAGILPSPSTPRFPTPRTP 
RTPRTPRGAGGPASAQGSVKYENSDLYSPASTPSTCRPLNSVEP 
ATVPSIPEAHSLYVNLILSESVMNLFKDCNSDSCCICVCNMNIK 

IIGRNTDC5KEAEKRFEALRATSAEHVNGGLKESEKLSDDLILL 
LQDQCTNLFSPFGAADQDPFPKSGVISNWVRVBERDCCNDCYLA 
LEHGRQFMDNMSGGKVDEALVKSSCLHPWSKRNDVSMQCSQDIL 
RMLLSLQPVLQDAIQKKRTVRPWGVQGPLTWQQFHKMAGRGSYG 
TDESPEPLPIPTFLLGYDYDYWLSPFALPYWERLMLEPYGSQR 
DIAYWLCPENEALMTGAKSFFRDLTAIYESCRLGQHRPVSRLL 
TDGIMRVGSTASKKLSEKLVAEWFSQAADGNNEAFSKLKLYAQV 
CRYDLGPYLASLPLDSSLLSQPNLVAPTSQSLITPPQMTNTGKA 
NTPSATLASAASSTMTVTSGVAISTSVATANSTLTTASTSSSSS 
SNLNSGVSSNKLPSFPPFGSMNSNAAGSMSTQANTVQSGQLGGQ 
QTSALQTAC-ISGESSSLPTQPHPDVSESTMDRDKVGIPTDC-DSH 
AVTYPPAIWYIIDPFTYENTDESTNSSSVWTLGLLRCFLEMVQ 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucir.e, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide dele 



iible 



:ide i 



>n) 



TLPPHIKS TVS VQI I PCQYLLQPVKHBDRE I YPQHLKSIiAFSAF 
TQCRRPLPTSTNVKTLTGFGPGLAMETALRSPDRPECIRLYAPP 
FILAPVKDKQTELGETFGEAGQKYNVLFVGYCLSHDQRWILASC 
TDLYGELLETCI INIDVPNRARRKKS 3ARKFGLQKLWEWCLGLV 
QMSSLPWRWIGRLGRIGHGELKDWSCLLSRRNLQSLSKRLKDM 
CRMCGISAADSPSILSACLVAMEPQGSFV1MPDSVSTGSVFGRS 
TTLNMQTSQLNTPQDTSCTHILVFPTSASVQVASATYTTENLDL 
AFNPNNDGADGMGIFDLLDTGDDLDPDIINILPASPTGSPVHSP 
GSHYPHGGDAGKGQSTDRLLSTEPHEEVPNILQQPLALGYFVST 
AKAGPLPDWFWSACPQAQYQCPLFLKASLHIiHVPSVQSDELLHS 
KHSHPIiDSNQTSDVLRFVLEQYNALSWLTCDPATQDRRSCLPIH 
FWLNQLYNFIMNML 

RWSRKLELRRAAQATESRPPQSQEMHPPTGKEVHALKRLRDSAN 
ANDVETVQQLLEaGADPCAADDKGRTALHFASCNGNDQIVQLLL 
DHGADPNQRDGLGNTPLHIiAACTHHVPVITTLLRGGARVDALDR 
AGRTPLHLAKSKLNILQEGHAQCLKAVR/HGGEADHPYAEGVSG 
APRAT*AARCSGVFPSPSRWLGSAPWSRSSCTIWSLPLHEAKCR 
AVRPLSSAAQGSAPSSSSCCTVSTSLALAESLSLFRACTSLPVG 
GCISWL 



lAAI^LHKLLLRSQSCRLCSFRKMRSPPKYRPFLACFTYTTDkor 
SKENTRTVEKLYKCSVDIRKIRR\*KDGYF*RMKPMLKKIiRl/F 
LQELGADETAVASILERCPEAIVCSPTAWTQRKLWQLVCXNEE 
ELIKLIECFPESFFTIKDQENQKLNVQFFQELGLKNWISRI,LT 
AAPNVFHNPVEKNKQMVRIL2ESYLDVGGSEANMKVWLLKLLSQ 
NPFIIjIiNSPTAIKETLEFLQEC2GFTSFEILQLIjSKLKGFLFQLC 
PRSIQNSISFSKNAFKCTDHDLKQLVLKCPALLYYSVPVLEERM 
QGLLREGISIAQIRETPMVLELTPQIVQYRIRKLNSSGYRIKDG 
HLANLNGSKKEF3ANFGKIQAKKVRPLFNPVAPLNVEE 



SPGAQFLWAAPDMPDPLr^ Y I I T~PF~PWLGKGGME 
PLRLLILLFVTELSGAHNTTVFQGVAGQSLQVSCPYDSMKHWGR 
RKAWCRQLGEKGPCQRWSTHNLWLLSFLRRWNGSTAITDDTLG 
GTLTITLRNLQPHDAGLYQCQSLHGSEADTLRKVLVEVLADPLD 
HRDAGDLWFPG\DLRASRMPMWSTAS?GASWKEK3PSHPLPSFS 
SWPASFSSRF*QPAPSGLQPGMDRSQGHIHPVMfTVAMTQGISS 
KLCQG 



vkktksifnsamqemevyvenirrkfgvfnyspfrtpytpnsqy 
qmlldptnpsagtakidkqekvklnfdmtaspkilmskpvlsgg 
tgrrislsdmprspmstnssvhtgsdveqdaekkatsshfsase 
esmdfldkstaspastktgqagslsgspkpfspqeisapittktd 
ktsttgsilnlktldrskaemdlkelsesvqqqstpvplispkrq 
irsrfqlnldktiesckaqlgineisedvytavehsdsedseks 
dssdseyisddeqi<s*gtsqedtedkegcqmdkepsavkkkpkp 
tnpveikeelkstspasekadpgavkdkaspepekdfsgkakps 
phpikdklkgkdetdsptvhlgldsdse\nelvidlgedhsgre 
grknkkepkepspkqdwgktppsttvgshsppetpvltrssaq 
tsaagatattstsstvtvtapapaatgspvkkqrpllpke\tap 
avqrscgtsstvqqkeitqspststitlvtstqssplvtssgsm 
stlvssvngdlpigtasadvaadiakytskl\mdaikgtm\tei 
yndlskn\ttwkaqlaedsqglrieieklqwlhqqel\semkhn 
leltmaemrqsweqerdrliaevkkqlelekqqavdetkk2cqwc 
anfkkeaifyccwntsycdypcq\qahwpeh\mksctqsatapq 
\qeadae\vntetlnkssqgsssstqsapsetasa\skeketsa 



DEKRGS\TRSDHN/TPSTQHGRSLLPGKESRAGTPFLGTSK 



' TKSIFNS.Z lEMHVYVENIRRKFGVFNYSPFRTPYTPNSQY 
QMLLDPTNPSAGTAKIDKQEKVKLNFDMTASPKILMSKPVLSGG 
TGRRISLSDMPKSPMSTNSSVHTGSDVEQDAEKKATSSHFSASE 
ESMDFLDKSTASPASTKTGQAGSLSGSPKPFSPQLSAPITTKTD 
KTSTTGSILNLNLDRSKAEMDLKELSESVQQQSTPVPLISPKRQ 
IRSRFQLNLDKTrESCKAQLGINEISEDVYTAVEHSDSEDSEKS 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=3lycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 








TNPVEIKEELKSTSPASEKADPGAVKDKASPEPEKDFSGKAKPS 
PHPIKDKLKGKD3TDSPTVHLGLDSDSE\NELVIDLGEDHSGRE 
GRKNKKEPKEPSPKQDWGKTPPSTTVGSHSPPETPVLTRSSAQ 
TS AAGATATTS TS STVT VTAPAPAATGS P VKKQRPLL P KE \ TAP 
AVQRSCGTSSTVQQKEITQSPSTSTITLVTSTQSSPLVTSSGSM 
STLVSSVNGDLPIGTASADVAADIAKYTSKL\MDAIKGTM\TEI 
YNDLSKN\TTWKAQLAEDSQ3LRIEIEKLQWLHQQEL\SEMKHN 
LELTMABMRQSW2QERDRLIAEVKKQLELEKQQAVDETKKKQWC 
ANFKKEAIFYCCWNTSYCDYPCQ\QAHWPEH\MKSCTQSATAPQ 
\QEADAE\WTETLNKSSQGSSSSTQSAPSETASA\SKEKETSA 
EKSKESGSTLDLSGSRETPSSILLGSNQGSDHSR\SNKSSWSSS 
DEKRGS\TRSDHN/TPSTQHGRSLLPGKESRAGTPFLGTSK 


5375 


2907 


HIS 


HIFLAEEEPMLERRCRGPIAiMGPAQPRLLSGPSQESPQTLGKES 

RCQA/RGLPLPCPECGRRFRHAPFLALHRQVHAAATPDWGFACH 
LCGQSFRGWVALVLHLRAHSAAKAGPPACPKMARDAFWRRKAAS 
SSILRRCHPSRPRGPRPFICGNCGRSILPTWDQ/LKVAHKRVHV 
SRRP*ERGPPAKVFWGPRPRGPPTGDTPPGPGGDAVDRPF\QCA 
CCGKRFRHK\PNLIRSHAACTSGERPHQ/CSRECG\KRFTNKPY 
LTS\HRRITHTARQPYPCKECGRRFRHKPNLLSHSKIHKRSEGS 
AQAAPGPGSPQLPAGPQESAAEPTPAVPLKPAQEPPPGAPPEHP 
QDPIEAPPSLYSCDDCGRSFRLERFLRAHQRQHTGFRPFTCAEC 
GKNFGKKTHLVAHSRVHSGERPFRLARKCGRRFLPRASQSGGRN 
SAEPNAPRFCPFVCPDCGKAFRIIKPYLAAHRPIATPAEKPYVCP 
DCRKAFSQKSNL\VSHRRIHTGERPYACPDCDRSFSQKSKLITH 
RKSHIRDGAFCCAICGQTFDDEERLLAHQKKHDV 


537S 


4504 


591 


VSTFSLCLWPAGGGGRGRVSNMAQSKRHVYSRTPSGSRMSAEAS 
ARPLRVGSRVEVIGKGHRGTVAYVGATLFATGKWVGVILDEAKG 
KNDGTVQGRKYFTCDEGHGI FVRQSQ IQVFEDGADTTS PETPDS 
SASKVLKREGTDTTAKTSKLRGLKPKKAPTARKTTTRRPKPTRP 
ASTGVAGASSSLGPSGSASAGELSSSEPSTPAQTPLAAP1IPTP 
VLTSPGAVPPLPSPSKEEEGLRAQVRDLEEKLETLRLKRAEDKA 
KLKELEKHKIQLEQVQEWKSXMQEQQADLQRRLKEARKEAKEAL 
EAKERYMEEMADTADAIEMATLDKEMAEERAESLQQEVEALKER 
VDELTTDLEILKAEIEEKGSDGAASSYQLKQLEEQNARLKDALV 
RMRDLSSSEKQEHVK\LQKLMEKKNQELEWRQQRERLQEELSQ 
AESTIDELKEQVDAALGAEEMVEMLTDRNLNLEEXVRELRETVG 
DLEAMNEMNDELQENARETELELREQLDMAGARVREAQKRVEAA 
QETVADYQQTIKKYRQLTAHLQDVNRELTNQQEASVERQQQPPP 
ETFDFKIKFAETKAHAKAIEMELRQMEVAQANRHMSLLTAFMPD 
SFLRPGGDHDCVLVLLLMPRLICKAELIRKQAQEKFELSENCSE 
RPGLRGAAGEQLSFAAIGLVY\SLMPAAGHRYHRY*CHALSQCR 
LD\VYKKVGSLYPEMSAHERS1,DFLIELL.HKDQLDETWVEPLT 
KAIKYYQHLYS IHLAEQPEDCTMQLADHI KFTQSALDCMS VEVG 
RLRAFLQGGQHATDIALLLRDLETSCS\DIRQFCKKIRRRMPGT 
DAPGIPAALAFGPQVSDTLLDCRKHLTWWAVLQEVAAAAAQLI 
APLAENEGLLVAALEELAFKASEQIYGTPSSSPYECEiRQSCWIL 
I STMNK\LVTAMQEGEYDAERPPSKPPP \ VELRAAALRAE I TDA 
EGLGLKLEDRETV1KELKKSLKIKGEELSEANVRLTLLEKKLDS 
AAKnADERIEKVQTRLEETQALLRKKEKEFEETMDAI.QADIDQL 
SAEKAELK2RLNSQSKRTIEGLRGPPPSGIATLVSGIAGEEQQR 
GAIPGQAPGSVPGPGLVKDSPLLLQQISAMRLEISQLQHENSIL 
KGAQMKASLASLPPLHVAKLSHEGPGSELPAGALYRKTSQLLET 
LNQLSTHTHWDITRTSPAAKSPSAQLMEQVAQLKSLSDTVEKL 
XDEVLKEWSQRPGATVPTDFATFPSSAFLRAKEEQQDDTVYMG 
KVTFS CAAGFGQRHRLVLTQEQLHQLHSRLI S 


5377 


762 


1106 


DVPCKRVLPAEAQEXGQLTLSCGESGEEGVF*YHETOQAEGES* 
/WFGPNVRLVHTQLKTKKPSGTLKAKFYSHTGSTKFAARISCTK 
S S *KPGYD3KKGGQYIFI FRGMRWEEQP 


5378 




664 


QASC-TTLRPLPDLPQLKRREATSRNRALKPRGRLVLMTSCLPAL 
RFIATPRLSAMPHIDNDVKLDFKDVLLRPKRSTLKSRSEVDLTR 
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W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








SFSFRNSEQTYSGVPIIAANMDTVGTFEMAKVLCKS*VPGSFWD 
VPQMGCVFLIYKLFTLKWKMLLLSVLLPASILVAEKFSLFTAVH 
KHYSLVQWQEFAGQNPDCLBHLAASSGTGSSDFEQLEQILEAIP 
QVKYICLDVANGYSEHFVEFVKDVRKRFPQHTIMAGNWTGEMV 
EEL I LSGAD 1 1 KVG IGPGSVCTTRKKTGVGYPQLSAVMECADAA 
HGLKGHIISDGGCSCPGDVAKAFGAGADFVMLGGMLAGHSESGG 
ELI ERDGKKYKLF YGMS S * I \AM\ KKYAGGVAE YRASEGKTVEV 
PFKGDVEHTIRDILGGIRSTCTYVGAAKLKELSRRTTFIRVTQQ 
VNPIFSEAC 


5379 


2009 


SS4 


QASGTTLP.PLPDLPQLKRREATSRNRALKPRGRLVLMTSCLPAL 
RFIATPRLSAMPHIDNDVKLDFKDVLLRPKRSTLKSRSEVDLTR 
SFSFRNSKQTYSGVPIIAANMDTVGTFEMAKV1CKS*VPGSFWD 
VPQMGCVFLIYKLFTLKWKKLLLSVLLPASILVAEKFSLFTAVH 
KHYSLVQWQEFAGQMPDCLEHLAASSGTGSSDFEQLEQILEAIP 
QVKYrCLDVAWGYSEHFVEFVEDVRKRFPQHTrMAGNWTGEMV 
EELILSGADIIKVGIGPGSVCTTRKKTGVGYPQLSAVMECADAA 
HGLKGHI I SDGGCS CPGDVAKAFGAGADFVMLGGMLAGHSESGG 
ELI ERDGKKYKLFYGMS S * l\AMVKKYAGGVAEYRASEGKTVEV 
PFKGDVEHTIRDILGGIRSTCTYVGAAKLKELSRRTTFIRVTQQ 
VNPIFSEAC 


5380 


2 


2050 


PSRAGGAERGRAAAARSPGGSAAGWECPSVLDEAGACTMSSCVS 
SQPSSNRAAPQDELGGRGSSSSESQKPCEALRGLSSLSIHLGME 
SFIWTECEPGCAVDLGLARDRPLEADGQEVPLDTSGSQARPHL 
SGRKLSLQERSQ3GLAAGGSLDMNGRCICPSLPYSPVSSPQSSP 
RLPRRPTVESHHVS ITGMQDCVQLNQYTLKDE IGKGSYGWKLA 
YNENDNTYYAMRVLSKKKLIRQAAFPRRPPPRGTRPAPGGCIQP 
RGP I\EQVYQEIA\ I LKKLDHPNW\ KLVEVL\DDPNEDHLYMV 
F\ELVN2GPVMEVPTLKPLSEDQARFYFQDLIKGIEYLHYQKII 
H\RDIKPSNLLVGEDGHIKIADFGVSNEFKGSDALLSNTVGTPA 
FMAPESLSETRKIFSGKALDWAMGVTLYCFVFG*CPFMDERIM 
CLHSKIKSQALEFPDQPDIAEDLKDLITRMLDKNPESRIWPEI 
KLHPWTRHGAEPLPSEDENCTLVEVTEEEVENSVKHIPSLATV 
ILVKTMIRKRSFGNPFEGSRREERSLSAPGNLLTKKPTRECESL 
SELKT*KISPLPACCKVT*EFPHPSGCRPSCWQPPFLIITHSQPR 
*PEPPRTDEALCPYETGRTCWAPLLQVLWWVGTPLPFPLSTSWL 
PDLVGAPGSHFCFLNIALLRYNSHTM 


5381 


2 


2050 


PSRAGGAERGRAAAARSPGGSAAGWECPSVLDEAGACTMSSCVS 
SQPSSNRAAPQDELGGRGSSSSESQKPCEALRGLSSLSIHLGME 
SFIWTECEPGCAVDLGLARDRPLEADGQEVPLDTSGSQARPHL 
SGRKLSLQERSQGGLAAGGSLDMNGRCICPSLPYSPVSSPQSSP 
RLPRRPTVESHHVSITGMQDCVQLNQYTLKDEIGKGSYGWKLA 
YNENDNTYYAMKVLSKKKLIRQAAFPRRPPPRGTRPAPGGCIQP 
RG P I \ EQVYQE I A\ I LKKLDHPNW\ KLVE VL \ DD PNEDHL YMV 
F\ELVNQGPVMEVPTLKPLSEDQARFYFQDLIKGIEYLHYQKII 
H\RDIKPSNLLVGEDGHIKIADFGVSNEFKGSDALLSNTVGTPA 
FKAPE SLSETRKI FSGKALDVWAMGVTLYCFVFG * CPFMDERIM 
CLHSKIKSQALEFPDQPDIAEDLKDLITRMLDKNPESRIWPEI 
KLHPWVTRHGAEPLPSEDENCTLVEVTEEEVENSVKHIPSLATV 
ILVKTMIRKRSFGNPFEGSRREERSLSAPGNLLTKKPTRECESL 
SELKT*KISPLPACCKVT*EFPHPSGCRPSCWQPPFLHTHSQPR 
♦PEPPRTDEALCPYETGRTCWAPLLQVLWWVGTPLPFPLSTSWL 
PDLVGAPGSHFCFLNIALLRYNSHTM - 


5382 


1536 


203 


3ARGSQQDAPALQEAEVRGPERAQPARGRMTKARLFRLWLVLGS 
VFMILLIIVYWDSAGAAHFYLHTSFSRPHTGPPLPTPGPDRDRE 
LTADSDVDEFLDKFLSAGVKQSDLPRKETEQPPAPGSMEESVRG 
YDWSPRDARRSPDQGRQOAERRSVLRGFCANSSLAFPTKERPFD 
DIPNSELSHLIVDDRHGAIYCYVPKVACTNWKRVMIVLSGSLLH 
RGAPYRDPLRIPREHVHNASAHLTFNKFWRRYGKLSRHLMKVKL 
KKYTKFLFVRD P FVRL I SAFRS KFELENEE F/ * PQVRRAHAAAV 
RQPHQPARLGARGLPRWPQWSFANFIQYLLDPHTEKLAPFNEH 
WRQVYRLCHPCQIDYDFVGKLETLDEDAAQLLQLLQVDLAAPLP 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 

residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 

residue of 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutaraine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=3top 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








PELPGTGPPSSWEEDWFAKIPLAWRQQLYKLYEADFVLFGYPKP 
ENLLRD 


5383 


45 


3250 


verllgcrnskrtkrmlisknmpwrrlqgisfgmysaeelkkls 
vksitnpryldslgnpsanglydlalgpadskevcstcvqdfsn 
csghlghielplrvynpllfdklylllrgsclnchmltcpravi 
hlllcqlrvlevgalqavyelerilsrfleenadpsaseireel 
eqytteivqnnllgsqgahvknvcesksklialfwkahmnakrc 
phcktgrswrkehnskltitfpamvhrtagqkdseplgi2eaq 
igkrgyltptsarehlsalwknegfflnylfsgmdddgmesrfn 
p s vf fldflwp psrs r pvsrlgdqmftngqtvnlqavmkdwl 
irkllalmaqeqklpeevatpttdeei^'sliaidrsflstijpgq 
slidklykiwirlqshvnivfdsemdklmmdkypgirqilekke 
glfrkhmkgkrvdyaars vicpdmyintne ig i pmvfatkltyp 
qpvtpwnvqelrqavingpnvhpgasmvinedgsrtalsatomt 
qreavakclltpatgapkpqgtkivcrhvkngdilllnrqptlh 
rps iqahrari lpeekvlrlhyanckaynadfdgdemnahfpqs 
elgraeayvlactdqqylvpkdgqpiagliqdhmvsgasmttrg 
cfftrehymelvyrgltdkvgrvkllspsilkpfplwtgkqws 
tiiliniipedhiplnlsgkakitgkawvketprsvpgfnpdsmc 
esqvli.<egellcgvldkaiiygssayglvhccyeiyggrtsgkv 
ltclarlftaylqlyrgftlgvedilvkpkadvkrqriieesth 
cgpqavraalnlpeaasydevrgkwqdahlgkdqrdfnmidlkf 
keevmhyskeinkacmpfglhrqfpentlqlmvqsgakgstvnt 
mqisci,t,gqielegrstplmasgkslpcfepyeftpraggfvtg 
rfltgikppefffhcmagreglvdtavktsrsgylqrciikhle 
i eltvrdsdgswqflygedgldipktqflqpkqfpfla 
snyevimksqhlhevlsradpkkalhhfraikkwqsfjipntllr 
rgaflsysqkiqeavkalklesenrngr/rpwds/g/rmlrmwy 
sldeesrrkyqkkaaacpdpslsvwrpdiyfasvsetfetkvdd 
ysqewaaqteksyekselsldrlrtllql\kwqrslcepgeavg 
llaaqsigepstqmtlntfh-agrgemwvtlgiprlreilmvas 
aniktpmmsvpvlntkkalkrvkslkkqltrvclgevlqkidvq 
3sfcmeekqnkfqvyqlrfqflphayyqqekclrpedilrfmet 
rffkllmesikkknnkasafrnvntrratqrdldnagelgrsrg 
eqegdeeeeghivdaeaeegdadasdakrkekqeeevdyeseee 
eeregeenddedmqeernphregarktqeqdeevgl/gh*ggpv 
psrppdaapethpqpgapga\eamerrvoavreihpfiddyqyd 
teeslwcqvtvklplmkinfdmsslwsiahgaviyatkgitrc 
llnettnneosiekelvlnteginlpelfkyaevldlrrljysndih 
ai antygi eaalrvieke i kdvfavyg iavdprhls lvadymcf 
egvykplnrfgirsnssplqqmtfetsfqflkqatmlgshdelr 
s psaclwgkwrggtglfelkqplr 


5384 


195 


BBS 


QSCGQRLPTVL*L*GPPGSCPCILSLF\PGRPHALPEIRPYINI 
TILKGDKGDPGPMGLPGYMGREGPQGEPGPQGSKGDKGEMGSPG 
APCQKRFFAFSVGRKTALHSGEDFQTLLFERVFVNLDGCFDMAT 
GQFAAPLRGIYFFSL^^raSW^JYKETYVHIMHNQKEAVILYAOPS 
ERSIMQSQSVMLDLAYGDRWVRLFKRQRENAIYSNDFDTYITF 
SGHLIKAEDD 


5385 


32S 


799 


LMVPRTKKEAPAPPKAEAKAKAL\KAKKAVLKDVESHKKNKIHM 
SPTFRRPKTL*LRRQPKYPWKSTPRRNKLDHHVIIKFPLTTE*A 
iri"NSLLVFTVDVKANKHQIKQAVKK/LCDIDVAKVNTLIQ 
SDGERKAYVRIAPDYDALWATKIGIT 


5386 


326 


799 


LMVPRTK.KEAPAPPKAEAKAKAL\KAKKAVLKDVHSHKKNKIHM 
S PTFRR P KTIi* LRRQ P KYPWKS TPRRNKLDHHVI I KFPLTTE * A 
VKK I EN.\ T S LLVFTVDVKANKHQ IKQAVKK/LCDIDVAKVNTL IQ 
SDGERKAYVRrAPDYDALWATKIGIT 


5387 


2 


2117 


FWAASC-GCWFVLGERRAGSLLSASYGTFAMPGMVLFGRRWAIA 
SDBLVFPGFFELWRVLWWIGILTLYLMHRGKLDCAGGALLSSY 
LIVLMILLAWICTVSAIMCVSMRGTICNPGPRKSMSKLLYIRL 
ALFFPEMVWAS LGAAWVADG VQCDRTWNG 1 1 ATVWS W 1 1 IAA 
TWSIIIVFDPLGGKMAPYSSAGPSHLDSHDSSQLLNGLKTAAT 
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SEQ 
ID 


nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 

corre sponding 
to first 
amino acid 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G-Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 








: v ETRI XLLCCCIGKDDHTRVAFSS TAELFS^YFSDTDLVPSD 
IAAGLALLHQQQDNIRNNQEPAQWCHAPGSSQEADLDAELKNC 
HHYMQFAAAAYGWPLYIYRNPLTGLCRIGGDCCRSKWPQTMT/M 
VGGDQLQL/CTSAPILHTHRAAVQGLHPRQLPWTRFTELPFIjVA 
LDHRKE3VWAVRGTMSLQDVLTDLSAESEVLDVECEVQDRLAH 
KGISQA^YVYQRLINDGILSQAFSIAPEYRLVIVGHSLGGGAA 

KDVI PRLSVTNLEDLKRR I LRWAHCNKPKYKI LLHGLWYELFG 
3NPNNLPTELDGGDQEVLTQPLLGEQSLLTRWSPAYSFSSDSPL 
DSSPKYPPLYPPGRIIHLQEEGASGRFGCCSAAHYSAKWSHEAE 
FSKILIGPKMLTDHMPDILMRALDSVVSDRAACVSCPAQGVSSV 
DVA 


5388 


1569 


753 


TADGGAGGGGRRQA3VRRHYLYPFTGGYRRRRAACQAERPAARS 

TNGMCSVCYKEHLQRQNSSNGRISPPVQCTDGSVPEAQSALDST 
SSSMQPSPVSNQSLLSESVA3SQLDSTSVDKAVPETEDVQASVS 
DTAQQPSEEQSKSLE\NRNKKRIAVSCAGRKWDLLGLNAGVEMF 
TWYTVTQMYTIALTITKOMLKNFVFOQEFKSFGSFHQQLLEYK 
II.EHLQTKN 


5389 


15S9 


753 


TAUGGAGGGGRRQAGVRRHYLYPFTGGYRRRRAACQAERPAARS 
KDTDLAAYQKGNLGVQLRNMAQETNHSQVPMLCSTGCGFYGNPR 
TNGMCSVCYKEHLQRQNSSNGRISPPVQCTDGSVPEAQSALDST 
~ - i 1 ' 1 - 1 ^ JZjLSESVASSQLDSTSVDKAVPETEDVQA8V8 
DTAQQPSEEQSKSLB\NRNKXRIAVSCAGRKWDLLGLNAGVEMF 
TWYTVTQMYTIALTITKQMjKNFVFQQEFKSFGSFHQQLLEYK 
ILEHLQTKN 


5390 


217 


1332 


F.nPRKLMEDKMWSECEGPEMSLVCLTDFQAHAREQLSKSTRDFI 
EGGADDSITRDDNIAAFKRIRLRPRYLRDVSEVDTRTTIQGEEI 
3APICIAPTGFHCLVWPDGEMSTARAAQAA\GICYITSTFASCS 
LEDIVIAAPEGLRWFQLYVHPDLQLNKQLIQRVESLGFKALVIT 
LDTPVCGNRRHDIRNQLRRNLTLTDLQSPKKGNAIPYFQMTPIS 
TSLCWNDLSWFQSITRLPI1LKGILTKEDAELAVKHNVQGIIVS 

ALALGAKCI FLGDAI LWALASKGEHGVKEVLN I LTNEFHTSMA\ 
LTGCRSVAEINRNLVQFSRL 


5391 


1 


1292 


VKKAAGRSRGPPTAGGQRCEEAPGTVMERRLGVRAWVKENRGSF 
QPPVCNKLMIIQEQLKVMFVGGPNTRKDYHIEEGEEVFYQLEGDM 
VLRVLEQGKHRDWIRQGEIFLLPARVPHSPQRFANTVGLWER 
RRLETELDGLRYYVGDTMDVLFEKWFYCKDLGTQLAPIIQEFFS 
SEQYRTGKPIPDQLLKEPPFPLSTRSIMSPMSLDAWLDSHHREL 
QAGTPLSLFGDTYETQVIAYGQGSSEGLRQNVDVWLWQLEGSSV 
VTMGGRRLSLGPWMDSLLVLSWGPSY\AW\ERTQGSVALSVT\Q 
DPACKKSPWGEPSCHGLKAATGVPSTLEVPSLPNMSPSPHYLSV 
YCRCVPHRPAHCCHPPSCPSQPRCHAPGRAAAPHLLWQTQPTAL 
PVLPGGLPPAPLLPIPLSLQTQCSTSTPRRPSIKAS 








IRGSNAOF"'" 3ASG G PQPDPAG! GVPALAAAVLGACEPR 
CAAPCPLPALSRCRGAGSRGSRGGRGAAGSGDAAAAAEWIRKGS 
FrHKPAHGWLHPDARVLGPGVSYWRYMGCIEVLRSMRSLDFMT 
RTQVTREAINRLHEAVPGVRGSWKKKAPNKALASVLGKSNLRFA 
GMSISIHISTDGLSLSVPATRQVIANIIHMPSISFASGGDTBMTD 
YVAYVAKDP INQRACHI LECCEGL\AQS 1 I STVGQAFELRFKQ Y 
LHSPPK\ r ALPPERIiAGPEESAWGDEEDSLEHNYYNSIPGKEPPL 
GGLVDSRIiALTQPCALTALDQGPSPSLRDACSIiPWDVGSTGTAP 
PGDGYVQADARGPPDHEEHLYVNTQGLDAPEPEDSPKKDLFDMR 
PFEDALKLHECSVAAGVTAAPLPLEDQWPSPPTRRAPVAPTEEQ 
LRQEPWYHGRMSRRAAERMLRADGDFLVRDSVTNPGQYVLTGMH 
AGQPKHIiLLVDPEGWRTKDVLFESISHLIDHHLQMGQPIVAAE 
SELHLRGWSREP 


5393 


2 


982 


1 HI J ;i'SQNVCPRNLWLLQPLTVLI J LLASADSQAAAP 
PKAVLKLSPPWIK^/LQXEDSVTLTCQGAPQP/ERSDSIQWFHNG 



310 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 


Predicted 
nucleotide 
corresponding 

residue of 
sequence 


Predicted end 

nucleotide 

location 

to first 9 

residue of 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenyl alanine, G=Glycine, 

L=Leucir.e, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y= Tyrosine, X=Unknown, *=S~op 
Codon, /=possible nucleotide deletion, 








\NL I PTHTQPS \ YRFKANNN\DSGEYTCQTGQTSL\SDPVHLTV 

SQKFSHLDPTFSIPQANHSHSGDYHCTGNIGYTLFSSKPVTITV 
QVPSMGSSSPMGIIVAWIATAVAAIVAAVVALIYCRKKRrSAN 
STDPVKAAQFEPPGRQMIAIRKRQLEETNNDYETADGGYMTLNP 
RAPTDDDKNI YLTLPPNDHVNSNN 


5394 


2 


982 


GGDSAGMTMETQMSQNVCPRNLWLLQPLTVLLLLASADSQAAAP 
PKAVLKLEPPWINVLQ\EDSVTLTCQGAPQP/ERSDSIQWFHNG 
\NLIPTHTQPS\YRFKANNN\DSGEYTCQTGQTSL\SDPVHLTV 
LSEWLVLGTPHLEFQEGETIMLRCHS\WRDKP\LVKVTFFQNGK 
SQKFSHLDPTFSIPQANHSHSGDYHCTGNIGYTLFSSKPVTITV 
QVPSMGSSEPMGIIVAWIATAVAAIVAAWALIYCRKKRISAN 
STDPVKAAQFEPPGRQMIAIRKRQLEETNNDYETADGGYMTLNP 
RAPTDDDKNI YLTLPPNDHVNSNN 


5395 


3135 


531 


rasdaknqegllntrrkstdsvpiskstlsrslsliqasdfdgas 
s sgnpeavalapdaystgs s sass tlkrtkkpp.p pslkkkqttk 
kptetppvketooepdeeslvpsgenlasetktesaktegpspa 
lleetplepaagpkaacpldsesvegwppasgggrvqnsppvg 
rktlplttapeagevtpsdsggqedspakghsvr1efdysedks 
swdnqqsnppptkkigkkpvakmplrrpkmkktpekldntpasp 
prspaepndipiakgtytfdidkwddpnfnpfsstskmqespkl 

angvdgdglnkpakkkktplktdtfrvkkspkrsplsdppsqdp 
tpaatpetppvisavvhatdeeklavtnqkwtcmtvdleadkqd 
ypqpsdlstfvwetkfsspteeldyrksyeieymekigsslpqd 
ddapkkqalylmfdtsqespvksspvrmsesptpcsgssfeete 
alwtaaknqhpvprgtapnqeshlqvpekssqkeleamglgtp 
seaieitapegsfasadallsrlahpvslcgaldylepdlaekn 
pplfaqklqreaahptdvsisktalysrigtaevekpagllfqq 
pdldsalqiaraeiitkerevsewkdkyeesrkevmemrkivae 
yektiaqmiedeqr3ksvs\hqtvqqlvlekeqa\ladlnsvek 
nsladlfrryekmkevlegfrkneevlkrcaqeylsrvkkeeqr 
yqalkvha\eekldranae\iaqvrgkaqqeqaahqaslaerss 
crv\dalertleqknkeieeltkicdeliakmgks 


53 95 


3135 


531 


RASDAKNQEGLLNTRRKSTDSVPISKSTLSRSLSLQASDFDGAS 
SSGNPEAVALAPDAYSTGSSSASSTLKRTKKPRPPSLKKKQTTK 
XPTETPPVKETQQEPDEEELVPSGENLASETKTESAKTEGPSPA 
LLEETPLEPAAGPKAACPLDSESVEGWPPASGGGRVQNSPPVG 
RKTLPLTTAPEAGEVTPSDSGGQEDSPAKGHSVRLEFDYSEDKS 
SWDNQQENPPPTKKIGKKPVAKMPLRRPKMKKTPEKLDNTPASP 
PRSPAEPNDIPIAKGTYTFDIDKWDDPNFNPFSSTSKMQESPKL 

ANGVDGDGLNKPAKKKKTPLKTDTFRVKKSPKRSPLSDPPSQDP 
TPAATPETPPVISAWHATDEEKLAVTNQKWTCMTVDLEADKQD 
YPQPSDLSTFVNETKFSSPTEELDYRNSYEIEYMEKIGSSLPQD 
DDAPKKQALYLMFDTSQESPVKSSPVRMSESPTPCSGSSFEETE 
ALVNTAAKNQHPVPRGLAPNQESHLQVPEKSSQKELEAMGLGTP 
SEAIEITAPEGSFASADALLSRLAHPVSLCGALDYLEPDLAEKN 
PPLFAQKLQREAAHPTDVSISKTALYSRIGTAEVERPAGLLFQQ 
PDLDSALQ IARAE I ITKEREVSEWKDKYEESRREVMEMRKI VAE 
YEKTIAQMIEDEQREKSVS\HQTVQQLVLEKEQA\LADLNSVEK 
\SLADLFRRYEKMKEVLEGFRKNEEVLKRCAQEYLSRVKKEEQR 
YQALKVHA\EEKLDRANAE\IAQVRGKAQQEQAAHQASLAERSS 
CRV\DALERTLEQKIJKEIEELTKICDELIAKMGKS 


5397 


3135 


531 


RASDAKNQ3GLLNTRRKSTDSVPISKSTLSRSLSLQASDFDGAS 
SS3NPEAVALAPDAYSTGSSSASSTLKRTKKPRPPSLKKKQTTK 
KPTETPPVXETQQEPDEESLVPSGENLASETKTESAKTEGPSPA 
LLEETPLEPAAGPKAACPLDSESVEGWPPASGGGRVQNSPPVG 
RXTLPLTTAPEAC-EVTPSDSGGQEDSPAKGHSVRLEFDYSEDKS 
SWDNQQENPPPTKKIGKKPVAKMPLRRPKMKKTPEKLDNTPASP 
PR3PAEPNDIPIAKGTYTFDIDKWDDPNFNPFSSTSKMQESPKL 
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: SEQ 
ID 
NO: 


beginning 
nucleotide 

corresponding 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








DT 1 PICT i- r o Ell E 
ANGVDGDGLNKPAKKKKTPLKTDTFRVKKSPKRSPLSDPPSQDP 
TPAATPETPPVISAWHATDEEKLAVTNQKWTCMTVDLEADKQD 
YPQPSDLSTFVNETKFSSPTEELDYRNSYEIEYMEKrGSSLPQD 
DDAPKKQALYLMFDTSQESPVKSSPVRMSESPTPCSGSSFEETE 
ALVNTAAXNQHPVPRGLAPNQESHLQVPEKSSQKELEAMGLGTP 
SEAIE ITAPEGS FASADALLSRIAHPVSLCGALDYLEPDLAEKN 
PPLFAQKLQREAAHPTDVSISKTALYSRIGTAEVEKPAGLLFQQ 
PDLDSALQ IARAEIITKEREVSEWKDKYEESRREVMEMRKIVAE 
YEKTIAQMIEDEQREKSVS\HQTVQQLVLEKEQA\LADLNSVEK 
\SIADLFRRYEKMKEVLEGFRKNEEVLKRCAQEYLSRVKKEEQR 
YQALKVHA\EEKLDRANAE \ IAQVRGKAQQEQAAHQASLAERSS 
CRV\DALERTLEQKNKEIEELTKICDELIAKMGKS 


5398 


56 


54 2 G 


SGEVCRMESNFNQEGVPRPSYVFSADPIARPSEINFDGIKLDLS 
HEFSLVAPNTEAMSFESKDYLQVCLRIRPFTQSEKELESEGCVH 
ILDSQTWLKEPQCILGRLSEKSSG\QM\AQKFSFFPGFLGPAT 
TQKEFFQGCIMHP\VKDLLKGQSRLIFTYGLTNSGKTYTFQGTE 
ENIR1LPRTLNVLFDSLQERLYTKMNLKPHRSREYLRLSSEQEK 
r - 1 ?I SALLRCIKEVTVHNDSDDTLYGSLTNSLK.TSEFEESIK 
DYEQANLNMANSIKFSVWVSFFEIYNEYIYDLFVPVSSKFQKRK 
MLRLSQDVKGYSFIKDLQWIQVSDSKEAYRLLKLGIKHQSVAFT 
KLNNASSRSHSIFTVKILQIEDSEMSRVIRVSELSLCDLAGSER 
TMKTQKEGERLRETGHINTSLLTLGKCINVLKNSEKSKFQQHVP 
FRESKLTHYF/QSFFNGKGKICMIVNISQCYLAYDETLNVLKFS 
[ i 7PDTLNSSQEK1FGPVK3SQDVSLDSNSNSKILNVKR 
ATISWENSLEDLMEDEDLVEELENAEETED/V3ETKLLDEDLDK 
TLEENKAFISHEEKRKLLDLIEDLKKKLINEKKEKLTLEFKIRE 
EVTQEFTQYWAQREADFKETLLQEREILEENAERRLAIFKDLVG 
KCDTREEAAKDICATKVETEEATACLELKFNQIKAELAKTKGEL 
IKTKEELKKREWESDSLIQELETSWKKI ITQNQRIKELINIIDQ 
KEDTINEFQNLKSHMENTFKCNDKADTS SL I INNKLI CNETVEV 
PKDSKSKICSERKRVNENELQQDEPPAKKGSIHVSSAITEDQKK 
SEEVRPNIABIEDIRVLQENNEGLRAFLLTIENELKNEKEEKAE 
LKKQI VHFQQELSLSEKKNLTLS KEVQQ I QSNYD IAI AELHVQK 
SKHQEQEEKIMKLSNEIETATRSITNNVSQIKLMHTKIDELRTL 
DSVSQISKIDLLNLRDLSNGSEEDNLPNTQLDLLGNDYLVSKQV 
KEYRIQEPNRENSFHSSIEAIWEECKEIVKASSKKSHQIEELEQ 
QIEKLQAEVKGYKDENNRLKEKEHKN3DDLLKEKETLIQQLKEE 
LQEKNVTLDVQIQHWEGKRALSELTQGVTCYKAKIKELETILE 
TQKVERSHSAKLEQDILEKESIILKLERNLKEFQEHLQ.DSVKNT 
KDLNVKELKLKEEITQLTNNLQDMKHLLQLKEEEEETNRQETEK 
LKESLSASSART!?N\LNADLQRKEEDYADLKEKLTDAKKQIKQV 
QKEVSVMRDEDKLLRIKINELEKKKNQCSQELDMKQR\TIQQLK 
EQLINQKVEEAIQQYERACKDLNVKEKIIEDMRMTLEEQEQTQV 
EQDQVLXEAKLEEVERLATELDRWRVKCNDLETKNNQRSNKEHE 
KKTl'/LQKLTNLQDELQESEQKYNADRKKWLEEKMMLITQAKEA 
EKIRNKEMKKYABDRERFFKQQNEMEILTAQLTEKDSDLQKWRE 
ERDQLVAALEIQLKAIiISSNVQKDNEIEQLKRIISETSKISTQI 
MDIKPKRISSADPDKLQTEPIiSTSFEISRNKIEDGSWLDSCEV 
STENDQSTRFPKPELEIQFTPLQPNKMAVKHPGCTTPVTVKIPK 
ARKRKSNEMEEDLVKCEWlOOrATPRTMLKFPISDDRNSSVXKEQ 
KVAIRPSSKKTYSLRSQASIIGWLATKKKEGTLQKFGDFLQHS 
PSILQSKAKKIIETMSSSKL3NVEASKENVSQPKRAKRKLYTSE 
ISSP I DI SGQVILMDQKMKESDHQI I KRRLRTKTAK 


5399 




230 


GPEFAXFLSQDQINEYKECFSLYDKQQRGKIKATDLMVAMRCLG 
ASP7PGEVQRHLQTHGIDGNGELDFSTFLTIMHMQIKQEDPKKE 
ILLMaLMVDKEKKGYVMASDLRSKLTSLGEKLTHKEV\DDLFRE 
\ADIEPNGKVKYDEFIHKITSYLDGTY 


5400 


931 


248 


SKCSSGMEIPPTNYPASRAALVAQNYINYQQGTPHRVFEVQKVK 
QASMED I PGRGHK YRLKFAVEEI I QKQVKVNCTA3 VLYPS TGQE 
TA?EVNFTFEGETGKNPDEEDNTFYQRLKSMKEPLEAQNI\PDN 
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corresponding 



=id segment containing signal peptide 
Lne, C=Cysteine, D=Aspartic Acid, I 
: Acid, F=Phenylalanine, G=Glycine, 



H=Histidine, I=Isol« 
L=Leucine, M=Methion: 
P=Proiine, Q=Glutamir 
S=Serine, T=Threonin<E 
W^Tryptophan, Y-Tyros 
Codon, ,/=poss 



K=Lysir 
ne, N=Asparagine, 

ine, X=Unknown, *=Stop 
leotide deletion, 



\=pcssible nucleotide in sertic 
FGNVS PEMTLVLHLAWVACG Y 1 1 WQNSTEDTWYKMVKI QTVKQV 
QRNDDF I EU3YT ILLHNIASQE 1 1 PWQMQVLWHPQYGTKVKHNS 
RLPKEVQLE 

TGWSYGPTTSLAFLAPRDFPFPPKLLIHPQAWRLSCGAGSMGS 
QAAAEWRNWASWEGSSSLSGCSMGCFKDDRIVFKTWMFSTYFME 
KWAPRQDDMLFYVRRKLAYSGSESGADGRKAAEPEVEVEVYRRD 
SX'KTiPGLGDPDIDWEESVCLNLILQKIjDYMVTCAVCTRADGGDI 
HIHKKKSQQVFASPSKHPMDSKGEESKISYPNIFFMIDSF\EE\ 
V?3DMTVGKGEMVCVELVASDKTOTF^GVIFQGSIRYEALKKVY 
DNRVSVAAR!4AQK\MSFGF£KYSNMEF\VR\MKGPQGKGHAEMA 

vsrvstgdtspcgteedsspaspmhervtsfstpptpernnrpa 
ffspslkrkvprnriaemkkshsandseeffr3ddggadlhnat 
nlrsrslsgtgrslvgswlklnradgnfllyahltyvtlplhri 

ltdilevrqkpilmt 

gecfimaawqqndlvfefasnvmederqlgdpaifpavivehv 
pgadilnsyaglacveepndmitessldvaeeeiidddddditl 
tveaschdgdetietieaaeallnmdspgpmldekrin1jnifss 
peddmwapvthvsvtldgipevmetqqvqekyadspgasspeq 
pkrkkgrktkpprpdspattpnisvkkkmicdgkgntiylwefll 
allqdkatcpkyikwtqrek3iflojvdskpvsriiwrkhknkp\d 
mnyepmgralryyyqrg i lakvegqrlvyqfkempkdli yinde 
dpsssiesi ) frnqtsrsrvssspgvkggattvlk 
pgnskaakpkdpvevaqpsevlrtvqptqspyptqlfrtvhwq 
pvqavpegeaartstmqdetlnssvqsirXtiqaptqvpvwsp 
rnqq\lhtvtlqtvplttviastdpsagtgsqkfilqaipssqp 
ktvlkenvmlqsqkags pps tvlgparv\qqvltsnvqti cngt 
vs v\as sfsfs \ atap wtlfllgs s qlvahppgtvitsvi ktq 
etktltqevekkesedhlkentekteqqpqpyvmvvsssngfts 
qvamkqnellepnsf 
gecfimaawqqndlvfefasnvmederqlgdpaifpavivehv 
pgad i lns yaglac ve epndmitessldvaeeei idddddditl 
tveaschdgdetietieaaeallkmdspgpmldekrinknifss 
peddmwapvthvsvtldgipevmetqqvqekyadspgasspeq 
pkrkkgrktkpprpdspattpnisvkkknkdgkgntiylwefll 
allqdkatcpkyikwtqrekgifklvdskpverlwrkhknkp\d 

MNYEPMGRALRYYYQRGILAKVEGQRLVYQFKEMPKDLIYINDE 
DFSSSIESSDPSLSSSATSNRNQTSRSRVSSSPGVKGGATTVLK 
PGRSKAAKPKDPVEVAQPSEVLRTVQPTQSPYPTQLFRTVHWQ 
PVQAVPEGEAARTS TMQDETLNSSVQS I R\T1QAPTQVPWVS P 
RNQQ\LHTVTLQTV?LTTVIASTDPSAGTGSQKFILQAIPSSQP 
MTVLKEKVMLQSQKAGSPPSIVLGPARV\QQVLTSNVQTICNGT 
VSV\ASSPSFS\ATAPWTLFLLGSSQLVAHPPGTVITSVIKTQ 
ETKTLTQEVEKKESEDHLKENTEKTEQQPQPYVMWSSSNGFTS 
QVAMKQNELLEPMSF 
LPVTLI FAKMKTLQSTLLLLLLVPLI KPAPPTQQDS RI I YDYGT 
DNFEESIFSQDYEDKYLDGKNIKEKETVIIPNEKSLQLQKDEAI 
TPLPPKKENDEMPTCLLCVCLSGSVYCEEVDIDAVPPLPKESAY 
LYARFWKIKKLT\AKDFADIPNLRRLDFTGNLIEDIEDGTFSKL 
SLVEELSLAEWQLLKLPVLPPKLTLFNAKYNKIKSRGIKANAFK 
KLNNLTFLYLDHNALESVPLNLPESr.RVIHLQFNNIASlTDDTF 
CKAMDTSYIRDRIEEIRLEGMPIVLGKHPM5FICLKRLPI GSYF 
QNSRSLHMDPQNQHGSGSSLWIQQPSLDSRPRLDYEREIQPTA 
ILSLDQIKAIRGSNEYTEGPSWKRPAPRTAPRQEKHERTHEII 
PINVHNNYEHRHTSHLGHAVLPSWARGPILSRSTSTGSAASSGS 
NSSASSEQGLLGRSPPTRPVPGHRSERAIRTQPKQLIVDDLKGS 
LKEDLTQHKFICEQCGKCKCGECTAPRTLPSCKACNRQCLCSAE 
SMVEYGTCMCL\VKGIFYHCSNDDEGDSYSDKPCSCSQSHCCSR 
YLCMGAMSLFLPCLIjCYPPAKGCLKLCRRCYDWIHRPGCRCKNS 

NTVYCKLESCPSRGQGKPS 

P.MRTYNVEaPLTFMDVAIEPCLEEWQCIiDTAQQNLYRNVMLENY | 
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Predicted 
beginning 

location 
corresponding 
to first 
:no acid 



Predicted end 
nucleotide 
location 
corresponding 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Kethionine , N=Asparagine , 
P=Proline, Q=C-lutamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 

\=possible nucleotide insertion) 

RNLVFLG/IIAVSKPDLITCLEQEKEPWEPMRRHEMVAKPPVMC 
SHFTQDFWPEQHIKDPFQKATLRRYKNCEHKNVHLKKDUKSVDE 
CKVHRGGYNGFNQCLPATQSKIFLFDKCVRAFHKFSNSNRHKIS 
HTEKKLFKCKECGKSFCMLSHLAQHKIIHTRVNFCKCEKCGKAF 
NCPSIITKHKRINTGEKPYTCEECGKVFNWSSRLTTHKKWYTRY 
KLYKCEECGKAFNKSSILTTHKIIRTGEKFYKCKECAKAFNQSS 
NLTEHKKIHPGEKPYKCEEC3KAFNWPSTLTKHKRIHTGEKPYT 
CEECGKAFNQFSNLTTHKRIKTA\EKFYKCTECGEAFSRS\SNL 
TKHKEIHTEKKPYKCBECGKAFKWSSKLTBHKLTHTGEKPYKCE 
KCGKAFNCPSIITKBNRINT3EKPYTCEECGKVFNWSSRLTTHK 
KNYTRYKLYKCEECGKAFNKSSILTTHKKIHIEKKFYKCEECGK 
AFKWSSKLTEHKITHTGEKPYKCEECGKAFNHFSILTKHKRIHT 
GBKPYKCEECGKAFTQSSNLTTHKKIHTGEKFYKCEECGKAFTQ 
SSNLTTHKKIHTGGKPYKCEECGKAFMQFSTLTKHKIIHTEEKP 
YKCEECGKAFKWSSTLTKHKIIHTGEKPYKCEECG\KAFKLSST 
LSTHKIIHTGEKPYKCEKCGKAFNRPSNLIEHKKIHTGEQPYKC 
EECGKAFNYSSHLNTHKRIHTKEQPYKCKECGEAFNQYSNIjTXH 

nkihtgzklykpedvtvilttpqtfsni k 

rprrrqs scctgwlagwllraapr fcrr tetdmeqgkglavl i l 

aiillqgtlaqsikgnhlvkvydyqedgsvlltcdaeaknitwf 

kdhkmigi'ltedkkkwnlggnakdprgmyqckgsqnkskplqvy 

yrmcqncielnaatisgflfaeivsifdlavgvyfiagtgmefr 

qs \rasdkqtllp\ndpaptqplkdprkmtqyshl qgn \qlrrn 

qcskgtciipqaqqpwdegvwqeapsqsepwgqsqepptmpqrlp 

HARQHTPLPLGSADYRRWSVRPQGPHRDPKDSRDAAKKEQGSL 
APRPVPASRGGKTLCKGYRQAPPGPPAQFQRP icsas ppwasrf 
STPCPGGAVREDTYPVGTQGVPSLALAQGGPQGSWRFLEWKSMP 
RLPTDLDrGGPWFPHYDFERSCWVRAISQEDQLATCWQAEHCGE 
VRNKDMSWPEEMSFIANSSKIDRHKVPTEKGATGLSNLGNTCFM 

nssiqcvsntqpltqyfisgrhlyelnrtnpigmkghmakcygd 
lvqelwsgtqknvaplklrwtiakyaprfngfqqqdsqellafl 
ldglhedlnrvhekpyvelkdsdgrpdwevaaeawdnhlrrnrs 
iwdlfhgqlrsqvkcktcghisvrfdpfwflslplpmdsymhl 
eitvikldgttpvryglrlnmdekytglkkqlsdlcglnseqil 
laevhgsniknfpqdnqkvrlsvsgflcafeipvpvspisassp 
tqtdfsss pstnemftlttngdlprp i f i pngmpnt wpcgtek 
wftngmvwghmpslpdspftgyiiavhrkmmrtelyflssqknr 
pslfgmplivpctvhtrkkdlydavwiqvsrlasplppqeasnh 
aqdcddsmgyqypftlrwqkdgnscawcpwyrfcrgckidcge 
drafignayiavdwhptalhlryqtsqerwdehesveqsrraq 
vepinldsclraftseeelgenemyycskckthclatkkldlwr 
lppiliihlkrfqfvngrwiksqkivkfpresfdpsaflvprdp 
alcqhkpltpqgdelseprilarevkkvdaqssageedvllsks 



VLGGSQPELVTPQDHEVALANGFLYEHEACGNGCGNGYSNGQLG 
KHSEEDSTDDQREDTRIKPIYNLYAISCHSGILGGGHYVTYAKN 
PNCKWYCYNDSSCKELHPDEIDTDSAYILFYEQQGIDYAQFLPK 

TDGKKMADTSSMDEDFESDY\EKYCVLQ 

QGSKGTCHPQAQQPWDEGVWQEAPSQSEPWGQSQEPPTMPQRLP 
HARQHTPLPLGSADYRRWSVRPQGPHRDPKDSREAAKREQGSL 
APRPVPASRGGKTLCKGYRQAPPGPPAQFQRPICSASPPWASRF 
STPCPGGAVREDTYPVGTQGVPSLALAQGGPQGSWRFLEWKSMP 
RLPTDLDIGGPWFPHYDFERSCWVRAISQEDQLATCWQAEHCGE 
VRNKDMSWPEEMSFIANSSKIDRHKVPTEKGATGLSNLGNTCFM 
NSSIQCVSNTQPLTQYFISGRHLYELNRTNPIGMKGHMAKCYGD 
LVQELWSGTQKNVAPLKLRWTIAKYAPRFNGFQQQDSQELLAFL 
LDGLHEDL1IRVHEKPYVELEODSDGRPDWEVAAEAWDNHLRRNRS 
IWDLFHGQLRSQVKCKTCGHISVRFDPFNFLSLPLPMDSYMHL 
EITVI KLDGTTP VR YGLRLNMDEK YTGLKKQLS DLCGLNS EQIL 
LAEVHGSNIKNFPQDNQKVRLSVSGFLCAFEIPVPVSPISASSP | 
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SEQ 
ID 


Predicted 
beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amine acid 
sequence 


Predicted end 
nucleotide 

corre sponding 

amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cystelne, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptcphan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








rQTDFS £ SFS TNEMFTLTTNGDLPRP I FI PNGMPNTWP CGTEK 

PSLFGMPLIVPCTVHTRKKDLYDAVWIQVSRIiASPLPPQEASKH 
AQDCDDSMGYQYPFTLRWQKDGNSCAWCPWYRFCRGCKIDCGE 
DRAFIGNAYIAVDWHPTALHLRYQTSQERWbEHESVEQSRRAQ 
VEPINLDSCLRAFTSEEELGENEMYYCSKCKTHC^ATKKLDLWR 
LPPILIIHLKRFQFVNGRWIKSQKIVKFPRESFDPSAFLVPRDP 

PSSLSANIISSPKGSPSSSRKSGTSCPSSKNSSPNSSPRTLGRS 
KGRLRLPQIGSKNKLSSSKENLDASKEN3AGQICELADALSRGH 
VLGGSQPELVTPQDHEVALANGFLYEHEACGNGCGMGYSNGQLG 
NHSEEDSTDDQREDTR I KP I YNLYAI SCHSG I LGGGHYVTYAKN 
PNCKWYCYNDSSCKELHPDEIDTDSAYILFYEQQGIDYAQFLPK 
TDGKKMADTSSMDEDFESDY\EKYCVLQ 


5410 


2 


71C 


LRFPGQARHVWLAARMQAPHXEHL YKLLVIGDLGVGKTS IIKRY 

MTE n REAMGAFIVFDVTRPATFEAVAKWKNDLDSKLSLPNG 
XPVSWLLANKCDOGKDVLMNNGLKMDQFCKEIIGFVGWFETSAK 
ENINIDEASRCLVKHILANECDLMESIEPDWKPHLTSTKVASC 
SG\CAKIIiVGTFAGVW 


5411 


1302 


289 


TGPAAAGRRKALGSFGKPSPVTGLRAARRRRTRPSAPAAPSVGC 
GKRRESDAGAGGERASVRTGSGRRGGRTMAGDSEQTLQNHQQPN 
GGEPFL IGVSGGTASGKS S VCAKI VQLLGQNE VDYRQKQ WI LS 
QDSFYRVLTSEQKAKALKGQFNFDHPDAFDNELILKTLKEITEG 
KTVQIPVYDFVSHSRKEETVTVYPADWLFEGILAFYSQER/IR 
DLFQMKLFVDTDADTRLSRRVLKDISERGRDLEQILSSSTLRFV 
KPA\FEEFCLPPK\KYAnvIIPR\GADN\RVPINLIVQHIQ\DI 
LNGGPS\NRQTNGCLNGYTPSRKRQASESSSRPH 


5412 


3180 


313 


QGISNFFHKEANFWFEVSGYLISPLRSPFVDPALEWSLMASPWN 
KMEGESSRFEIHTPVSDKKKKKCSIHKERPQKHSHEIFRDSSLV 
KEQSQITRRKKRKKDFQHLISSPLKKSRICDETANATSTLKKRK 
KRRYSALEVDEEAGVTWLVDKENINNTPKHFRKDVDWCVDMS 
IEQKLPRK\PKTDKFQVLAKSH\AHKSEALHSKVREKKWKKHQR 
KAASWESQ3A\RDTLPQSJiFP'I'QEESWLSVGPGGEITELP\ASA 
H;:-.Jr;:Ji:KKKKKSSNREYET\LAMPEGSQAGREAGTDMQESQPTV 
GLDDETPQLLGPTHKKKSKKKKKKKSNHQEFESLAMPEGSQVGS 








SKKRKLTSVKRARVSGDDFSVPSKNSESTLFDSVEGDGAMMEEG 
VKSRPRQKKTQACLASKHVQEAPRLEPAKEEHNVETAEDSEIRY 
LSADSGDADDSDADLGSAVKQLQEFI ENI KDRATST I KRMYRDD 
LERFKEFKAQGVAIKFGKFSVKENKQLEKNVEDFLALTGIESAD 
KLLYTDRYPEEKSVITNLKRRYSFRLHIG\RNIARPWKLIYYRA 
KKMFDVNNYKGRYSEGDTEKLKMYHSLLGNDWKTIGEMVARRSL 
SVALKFSQISSQKNRGAWSKSETRKLIKAVEEVILKKMSPQELK 
EVDSKLQENPESCLSIVREKLYKGISWVEVEAKVQTRNWMQCKS 
KWTEI LTKRMTNGRR I Y YGMNALRAKVS L I ERL YE IWVEDTNE I 
DWEDIiASAIGDVEPSYVQTKFSRLKAVYVPFWQKKTFPEIIDYL 
YETTLPLLKEKLEKMMEKKGTKIQTPAAPKQVFPFRDIFYYEDD 
SEGGGHRKRKRRPRRHAWFTPVIPVLWEAKAGWII 


5413 


3 753 


1304 


EFrAGVAPRRAMANVSKKVSWSGRDRDDEEAAPLLRRTARPGGG 
TPLLNGAGPGAARQSPRSALFRVGHMSSVKLDDELLEP\DMDPP 
HPFPKEIPHKEKLLSLKYESLDYDNSENQLFLEEERRINHTAFR 
TVEIKRWVICALIGILTGLVACFIDIWENLAGLKYRVIKGNID 
KFTEKGC-LSFSLLLWATLNAAFVLVGSVIVAFIEPVAAGSGIPQ 
IKCFLNGVKIPHWRLKTLVIKVSGVILSWGGLAVGKEGPMIH 
SGSVIAAGISQGRSTSLKRDFKIFEYLRRDTEKRDFVSAGAAAG 
VSAAFGAPVGGVLFSLEEGASFWNQFLTWRIFFASM1STFTLNF 
VLSIYHGNMWDLSaPGLINFGRFDSEKMAYTIHEIPVFIAMGW 
GGVLGAVFNALNYWLTMFRIRYIHRPCLQVIEAVLVAAVTATVA 
FVLIYSSRDCQPLQGGSMSYPLQLFCADGEYNSMAAAFFNTPEK | 
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Predicted 

beginning 

nucleotide 

location 

corresponding 



residue of 



corresponding 



Anir.c acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *aStop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 



SWSLFHDPPGSYNPLTLGLFTLVYFFLACWTYGLTVSAGVFIP 
SLLIGAAKGRLFGISLSYLTGAAIWADPGKYALMGAAAQLGGIV 
RMTLSLTVIMMEATSNVTYGFPIMLVLMTAKIVGDVFIEGLYDM 
HIQLQSVFFLHWEAPVTSHSLTAREVMSTPVTCLRRREKVGVIV 
DVLSDTASNHNG FPWEHADDTQPARLQGLI LRSQL IVLLKHKV 
FVERSNLGLVQRRLRLKDFRDAYPRFPPIQSIHVSQDERECTMD 
LS E FMNPS P YT VP QE AS L PRV FKLFRALGLRHLVWDNRNQ WG 
LVTRKDLARYRLGKRGLEELSLAQT 



GVASAWDRALFSPLLSPTSRVFRTSPPRCVSTETGRRDRARVPS 
QWCSVLQGKLPVSGRTSLACVRSILLSPASSPRKVGIVGGTGAR 
AGAAPRDHGRVRHRRPSSARRMTRTTGQCLAPRGCQGPRGTRSP 
RSPRSRTRRGCSASPACLP/CRSALIVAVLCYINLLNYMDRFTV 
AGVLPD I EOF FN I GD S S SGLI QT VF I S S YM VLAP VFGYLGDR YN 
RKYLMCGGIAFWSLVTLGSSFIPGEHFWLLLLTRGLVGVGEASY 
STIAPTLIADLFVADQRSRMLSIFYFAIPVGSGLGYIAGSKVKD 
MAGDWHWALRVTPGLGWAVLLLFLWREPPRGAVERHSDLPPL 
NPTSWWADLRALARNPSFVL3SLGFTAVAFVTGSLALWAPAFLL 
RSRWLGETPPCLP3DSCSSSDSLIFGLITCLTGVLGVGLGVEI 
3 RRLRHSNPRAD PLVCATGLLGS AP FLFLS LACARGS I VATYI F 
I FIGETLLSMNWAI VAD I LLYWI PTRRSTAEAFQ IVLSHLLGD 
AGS P Y^ IGLI .SDRLRRNWPPSFLSEFRALOFSLM^CAFVGAJ.RR 
AAFLGTAHLH 



rPi^ i L * NQEQAl'IFEEVQKLRPRNEQRENEL 

IISFLRCLFE3KQKEHIIIIG2MKQTSQMAAENIGSELPPSATRF 
RLDMLKNKAKRSLTESLESILSRGNKARGLQEHSISVDLDSSLS 
STLSNTSKEPSVCEKEALPISESSFKLLGSSEDLSSDSESHLPE 
EPAPLSPOQAFRRRANTLSHFPIECQEPPQPARGSPGVSQRKLM 
RYHSVSTETPHERKDFESKANHLGDSGGTPVKTRRHSWRQQIFL 
RVATPQKACDSSSRYEDYSELGELPPRSPLEPVCEDGPFGPPPE 
EKKRTSRELRELWQKAILQQILLLRMEKENQKLQASENDLLNKR 
LKLDYEEITPCLKEVTTWEKMLSTPGRSKIKFDMEKMHSAVGQ 
GVP\RHHRGEIWKFLAEQ FHLKHQFPSKQQPKDVPYKELLKQLT 
SQQHAILIOLGRTFPTHPYFSAQLGAGQLSLYNILKAYSLLDQE 
VGYCQGLSFVAGILLLHMSEEEAFKMLKFLMFDMGLRKQYRPDM 
IILQIQMYQLSRLLHDYHRDLYNHLEEH3IGPSLYAAPWFLTMF 
ASQFPLGFVARVFDMIFLQGTEVIFKVALSLLGSHKPLILQHEN 
LETIVDFIKSTLPWLGLVQMEKTINQVFEMDIAKQLQAYEVEYH 
VLQEELIDSSPLEDNQRMDKLEKTNSSLRKQNLDLLEQLQVANG 
RIQSLEATIEKLLSSESKLKQAMLTLELERSALLQTVEELRRRS 
AKPSDREPECTQPEPTGD 



KSQLFCFWGGKAGDILSGDQDKEQKDPYFVETPYGYQLDLDFLK 
YVDDIQKGNTIKRLNIQKRRKPSVPCPEPRTTSGQQGIWTSTES 
LSSSNSDDNKQCPNFLIARSQVTSTPISKPPPPLETSLPFLTIP 
ENRQLP P P S PQL P KHNLHVTKTLMETRRRLEQERATMQMTPGE F 
RRPRLASFGGMGTTSSLPSFVGSGNHNPAKHQLQNGYQGNGDYG 
SYAPAAPTTSSMGSSIRHSPLSSGISTPVTNVSPMHLQHIREQM 
AIALKRLKELEEQVRTIPVLQVKISVLQEEKRQLVSQLKNQRAA 
SQINVCGVRKRSYSAGNASQLEQLSRARRSGGELYIDYEEEEME 
TVEQSTQRIKEFRQL\TADMQALEQKIQDSSCEASSELRENGEC 
RSVAVGAEEWMNDIWYHRGSRSCKDAAVGTLVEMRNCGVSVTE 
AMLGVMTEADKEIELQQQTIESLKEKIYRLEVQLRETTHDREMT 
KLKQELQAAGSRKKVDKATiMAQPLVFSKVVEAWQTRDQMVGSH 
MDLVDTCVGTSVETNSVGISCQPECKNKWGPELPMflWWIVKER 
VEMHDRCAGRSVEMCDKSVSVEVSVCETGSNTEESVNDLTLLKT 
NLNLKEVRSIGCGDCSVDVTVCSPKECASRGVNTEAVSQVEAAV 
MAVPRTADQDTSTDLEQVHQFTNTETATLIESCTNTCLSTLDKQ 
TSTQTVETRTVAVGEGRVKDINSSTKTRSIGVGTLLSGHSGFDR 
PSAVKTKESGVGQININDNYLVGLKMRTIACGPPQLTVGLTASR 
RSVGVGDDPVGESLENPQPQAPLGMMTGLDHYIERIQKLLAEQQ 
TLLAENYSELAEAFGEPHSQMGSLNSQLISTLSSINSVMKSAST 
EELRNPDFQKTSLGKITGSYLGYTCKCGGLQSGSPLSSQTSQPE 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 


Amine acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I = Isoleucine, K=Lysir.e, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W^Tryptophan, Y=Tyrosine, X=t7nknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








QEVGTSEGKPISSLDAFPTQEGTLSPVNLTDDQIAAGLYACTNN 
SSTLKSIMKKKDGNKDSNGAKKNLQFVGINGGYETTSSDDSSSD 
ESSSSESDDECDVIEYPLEEEEEEEDEDTRGMAEGHHAVNIEGL 
KSARVEDEMQVQECEPEKVEIRERYELSEKMLSACNLLKNTIND 
PKALTSKDMRFCLNTLQHEWFRVSSQKSAIPAWGDYIAAFEAI 
S PDVLRYVINLADGNGNTALHYSVSHSNFE IVKLLLDADVCNVD 
HQNKAGYTPIMLAALAAVEAEKDMRIVEELFGCGDVNAKASQAG 
QTALMIAVSHGRIDMVKGLLACGADVNIQDDEGSTALMCASEHG 
HVEIVKLLLAQPGCNGHLEDNDGSTALSIALEAGHKDIAVLLYA 
HVNFAKAQS PGTPRLGRKTS PGPTHRGS FD 


5417 


27 


4074 


XSQLFCFWGGKAGDILSGDQDKEQKDPYFVETPYGYQLDLDFLK 
YVDDIQKGNTIKRLNIQKRRKPSVPCPEPRTT3GQQGIWTSTES 
LSSSNSDDNKQCPNFLIARSQVTSTPISKPPPPLETSLPFLTIP 
EHRQLPPPSPQLPKHNLHVTKTLMETRRRLEQSRATMQMTPGEF 
RRPRLASFGGMGTTSSLPSFVGSGNHNPAKHQLQNGYQGNGDYG 
S YAPAAPTTS SMGSSIRHSPLSSGI STPVTNVS PMHLQHI REQM 
AIALKRLKELEE2VRTIPVLQVKISVLQEEKRQLVSQLKNQRAA 
SQINVCGVRKRSYSAGNASQLEQLSRARRSGGELYIDYEEEEME 
TVEQSTQRIKEFRQL\TADMQMjEQKIQDSSCEASSELRE^GEC 
RSVAVGAEENMNDIVVYHRGSRSCKDAAVGTLVEMRNCGVSVTE 
AMLGVMTEADKEIELQQQTIESLKEKIYRLEVQLRETTHDREMT 
KLKQELQAAGSRKKVDKATMAQPLVFSKWEAWQTRDQMVGSH 
MDLVDTCVGTSVETNSVGISCQPECKNKWGPELPMNWWIVKER 
VEMHDRCAGRSVEMCDKSVSVEVSVCETCSNTEESVNDLTLLKT 
NLNLKEVRSIGCGDCSVDVTVCSPKECASRGVNTEAVSQV3AAV 
MAVPRTADQDTSTDLEQVHQFTNTETATLIESCTNTCLSTLDKQ 
TSTQTVETRTVAVGEGRVKDINSSTKTRSIGVGTLLSGHSGFDR 
PSAVKTKESGVGQININDNYLVGLKMRTIACGPPQLTVGLTASR 

TLLAENYSELAEAFGEPHSQMGSLNSQLISTLSSINSVMKSAST 
EELRNPDFQKTSLGKITGSYLGYTCKCGGLQSGSPLSSQTSQPE 
QEVGTSEGKPISSLDAFPTQEGTLSPVNLTDDQIAAGLYACTWN 
ESTLKSIMKKKDGNKDSNGAKKNLQFVGINGGYETTSSDDSSSD 
ESSSSE3DDECDVIEYPLEEEEEEEDEDTRGMAEGHHAVNIEGL 
KSARVEDEMQVQECEPEKVEIRERYELSEKMLSACNLLKNTIND 
PKALTSKDMRFCLNTLQHEWFRVSSQXSAIPAMVGDYIAAFEAI 
S PDVLRYV INLADGNGNTALHYS VSHSNFE IVKLLLDADVCNVD 
HQNKAGYTPIMLAALAAVEAEKDMRIVEELFGCGDVNAKASQAG 
QTALMLAVSHGRIDMVKGLLACGADVNIQDDEGSTALMCASEHG 
HVEIVKLLLAQPGCNGHLEDNDGSTALSIALEAGHKDIAVLLYA 
HVNFAKAQSPGTPRLGRKTSPGPTHRGSFD 


5418 


24 


1133 


SVPRAGGDKETGAAELYDQALLGILQHVGNVQDFLRVLFGFLYR" 
KTDFYRLLRHPSDRMGFPPGAAQALVLQVFKTFDHMARQDDEKR 
l.'.tL: /KIRKKEEEEAKTVSAAAAEKEPVPVPVQEIEIDSTTEL 
DGHQ E VE KVQ P PGP VKEMAHGSQE AEAPGAVAGAAE VPR \ E P P I 
LPRIQEQFQKNPDSYNGAVRENYTWSQDYTDLEVRVPVPKHVVK 

3 •." i /ai ss :,';rvamleengervlme3Klthkintesslwsl 

EPGKCVLVNLSKVGEYWWNAILEGEEPIDIDKINKERSMATVDE 
3EQAVLDRLTFDYHQKLQGKPQSHELKVHEMLKKGWDAEGSPFR 
GORFDPAMFNISPGAVQF 


5419 


1395 


259 


GTHPLDPDLVSRTSVQGPLMTMACPGMSDTEESPFLGPRAAEEG 
SESEACEAFGRRKSEEEGRRSDTSGFGRSRKHKVNWKHPEPADA 
KDPASLPQC/LGP/DCVRPAQPSSKYCSDDCGMKLAANRIYEIL 
21 H ^WQ2SPCIAEEHGKKLLERIRREQQSARTRLQEMERRFH 
3LEAI I LRAKQQAVREDEESNEGDSDDTDLQI FCVS CGHP INPR 
VALRHMERCYAKYESQTSFGSMYPTRIEGATRLFCDVYNPQSKT 
YCKRLQVLCPEHSRDPKVPADEVCGCPLVRDVFELTGDFCRLPK 
RQCNRHYCWEKLRRAEVDLERVRVWYKLDELFEQERNVRTAMTN 
RA3LLALMLHQTIQHDPLTTDLRSSADR 


5420 


117 


1733 


NEAGGACPb'KGGASGRLYLSPRLPRVSVAGCEERPLGWVWVLGG 
GGFLPARPPRAQRHLGFSHAEQSMEAPDYEVLSVREQLFHERIR 
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Predicted 
beginning 
icleotide 

corresponding 
first 
.no acid 



Predicted end 
nucleotide 

corresponding 
) first 
lino acid 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
~ienylalanine, G=Glycine, 
K=Lysir 



H=Histidine, I=Iso: 

P=Proline, Q=Glutai 
S=Serine, T=Threon; 
W=Tryptophan, Y=Ty: 
Codon, /=possible 1 
\=possible nucleotide insei 



.e , N=Asparagine , 
., R=Arginine, 

V=Valine, 
ne, X=Unknown, *=: 
.cleotide deletion, 



STLLFATLYILCHIFLTRFKKPAEFTT\GMMKMPPSTRL/ 
LLELCTFTLAIALGAVLLLPFS I ISN3VLLSLPRNYYIQWLNGS 
LIHGLWMLVFLFSNLSLIFLMPFAYFFTESEGFAGSRKGVLGRV 
YETWMLMLLTLLVLGMVWASAIVDKNKANRESLYDFWEYYLP 
YLYSCI3FLGVLLLLVCTPLGLARMFSVTGKLLVKPRLLEDLEE 
QLYCSAFEEAALTRRICNPTSCWLPLDMELLHRQVLALQTQRVL 
LEKRRKASAWQRNLGYPIAMLCLLVLTGLSVLIVAIHILELLID 
EAAMPRGMQGTSLGQVSFSKLGSFGAVIQWLIFYLMVSSWGF 
YSSPLFRSLRPRNHDTAMTQIIGNCVCLLVLSSALPVFSRTLGL 
TRFDLLGDFGRFNWLGNFYIVFLYNAAFAGLTTLCLVKTFTAAV 
RAELIRAFGERE 



NEAGGACPFKGGASGRLYLSPRLPRVSVAGCEERPLGWVWVLGG 
GGFLPARPPRAQRHLGFSHAEQSMEAPDYEVLSVREQLFHERIR 
ECTIST:,T,FATI.YTr,CHTFLTRFKKPAEFTT\GMMKMPPSTRL/ 
LLELCTFTLAIALGAVLLLPFSIISNEVLLSLPRNYYIQWLNGS 
LIHGLWNLVFLFSNLSLIFLMPFAYFFTESEGFAGSRKGVLGRV 
YETWMLMLIiTLLVLGtTVWVASAIVDKNKANRESLYDFWEYYLP 
YLYSCISFLGVLliLLVCTPLGLARMFSVTGKliLVKPRLLEDLiEE 
QLYCSAFEEAALTRRICNPTSCWLPLDMELLHRQVI1ALQTQRVI1 
tiEKRRKASAWQRNLGYPLAMLCLLVLTGLSVLIVAIHILELLID 
EAAMPRGMQGTSLGQVSFSKLGSFGAVIQWLIFYLMVSSVVGF 
YSSPLFRSLRPRWIDTAMTQIIGNCVCLLVLSSALPVFSRTLGL 
TRFDLLGDFGRFNWLGNFYIVFLYNAAFAGLTTLCLVKTFTAAV 
RAELIRAFGERE 



3CGES LPTWLAGASRPGI GRKGGAWGGRGGSS PAQVLLS PGPVF 
KAGCNWWHLSRDQAGVQRCDLGSSQPPPLGFKRFSCLSLPSSWD 
YRSTVLCVSKMEADLSGFNIDAPRWDORTFLGRVKHFLNITDPR 
TVFVSERELDWAKVMVEKSRMGWPPGTQVEQLLYAKKLYDSAF 
HPDTGEMNVIGRMSFQLPGGMIITGFMLQFYRTMPAVIFWQWV 
NQSFNALVNYTNRNAASPTSVRQMALSYFTATTTAVATAVGMNM 
LTKKAPPLVGRWVPFAAVAAANCVNIPMMRQQEL-KGICVKDRN 
3NEIGHSRRAAAIGITQWISRITMSAPGMILLPVIMERLEKLH 
FMQKVKVL/SAPLQVMDSGCFLIFMVPVACGLFPQKCELPVSYIi 
EPKLQDTI KAKYGELEPYVYFNKGL 



GVSMALGEEKAEAEASEDTKAQSYGRGSCRERELDIPGPMSGEQ 
PPRLEAEGGLISPVWGAEGIPAPTCWIGTDPGGPSRAHQPQASD 
ANREPVAERSEPALSGLPPATMGSGDLLLSGESQVEKTKLSSSE 
EFPQTLSLPRTTICSGHDADTEDDPSLADLPQALDLSQQPHSSG 
LSCLSQWKSVLSPGSAAQPSSCSISASSTGSSLQGHQERAEPRG 



GRRRLSFQAEYWACVLPDSLPPSPBRHSPLWNPNKEYEDLLDYT 
YPLRPGPQIjPKHLDSRVPADPVLQDSGVDLDSFSVSPASTLKSP 
TNVSPNCPPAEATALPFSGPREPSLKQWPSRVPQKQGGMGLASW 
SQLASTPRAPGSRDARWERREPALRGAKDRIiTIGKHLDMGSPQL 



GGEQGKESLVQC\VKTFC\COLEELICWLYNV\ADVTDHGTPAR 
SNLTSLK\SSLQLYRQFKKDIDEHQSTiTESVLQKGEILLQCLLE 
NTPVLEDVLGRIAKQSGELESHADRLYDSILASLDMLAGCTLIP 
DKKPMAAMEHPCEGV 



iGSCRERELDIPGPMSGEQ 
PPRLEAEGGIjISFWGAEGIPAPTCWIGTDPGGPSRAHQPQASD 
ANREPVAERSEPALSGLPPATMGSGDLLLSGESQVEKTKLSSSE 
EFPQTLSLPRTTICSGHDADTEDDPSLADLPQALDLSQQPHSSG 



GSIAKVSSSLEPWPQEPSSWGLGPRPQWSPQPVFSGGDASGL 
GRRRLSFQAEYWACVLPDSLPPSPDRHSPLWNPNKEYEDLLDYT 
YPLRPGPQLPKHLDSRVPADPVLQDSGVDLDSFSVSPA3TLKSP 
TNVSPNCPPASATALPFSGPREPSLKQWPSRVPQKQGGMGLASW 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

amino acid 

amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 
to first 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E« 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








SQIASTPRAPGSREARWERREPALRGAKDRLTIGKHDDMGSPQL 
RTRDRGWPSPRPEREKRTSQSARRPTCTESRWXSEEEVESDDEY 
LALPARLTQVSSLVSYLGSISTLVTLPTGDIKGQSPLEVSDSDG 
PASFPSSSSQSQLPPGAALQGSGDPEGQNPCFLRSFVRAHDSAG 
EGS-jGSSQALGVSSGLLKTRPSLPARLDRWPFSDPDVEGQIjPRK 

ggeqgkeslvqc\vktfc\cqleelicwlynv\advtdhgtpar 
snltslk\sslqlyrqfkkdidehqsltesvlqkgeillqclle 

NTPVLEDVLGRIAKQSGELESHADRLYDS1LASLDMLAGCTLIP 
DKKPMAAMEHPCEGV 


3425 


108S 


115 


GFCPSPSLGHQPPRVLHPTMSMAVETFGFFMATVGLLMLGVTLP 
NSYWRVSTVHGOTITTNTIFENLWFSCATDSLGVYNCWEFPSML 
ALSGYIQACRALMITA1LLGFLGLLLGIAGLRCTNIGGLELSRK 
AKIAATAGAPH\ILPGICGMVAI\SWYAFNITR\DFSDPLYPGT 
KYELGPALYLGWSASLISIL3GLCLCSACCCGSDEDPAASARRP 
YQAPVSVMPVATSDQEGDSSFGKYGRNALRVAALGRGPRCjPTA 
PKKRGPGRGPFPYSNLRGRPRPVPVAPPRPRPRVLHSHGPSQAK 
NCSWEVAYLPSEAGSLIF 


5426 




3435 


ATSSQSLGRADPPRGGTMERSPGEGPSPSPMDQPSAPSDPTDQP 
PAAHAKPEPGSGGQPAGPGAAGEALAVLTSFGRRLLVLIPVYLA 
GAVGLSVGFVLFGLALYLGWRRVRDEKERSLRAARQLLDDEEQL 
TAKTLYMSHRELPAWVSFPDVEKAEWLNKIVAQVWPFLGQYMEK 
LIiAETVAPAVRGSNPHLQTFTFTRVELGEKPLRI IGVKVHPGQR 
KEQ I LL-DLNI SYVGDVQI DVE VKKYFC KAGVKGMQ LHGVLRVI L 
EPLIGDLPFVGAVSMFFIRRPTLDINWTGMTNLLDIPGLSSLSD 
TMIMDSIAAFLVLPNRLLVPLVPDLQDVAQLRSPLPRG1IRIHL 
LAARGLSSKDKYVKGLIEGKSDPYALVRLGTQTFCSRVIDEELN 
PQWGET YEVMVHEVPGQE I EVEVFDKDPDKDDFLGRMKLDVGKV 
LQASVLDDWFPLQGGQGQVHLRLEWLSLLSDAEKLEQVLQWWWG 
VSSRPDPPSAAILWYLDRAQDLPMVTSELYPPQLKKGNKEPNP 
MVQLSIQDVTQESKAVYSTNCPVWEEAFRFFLQDPQSQELDVQV 
KDDSRALTLGALTLPLARLLTAPEItlLDQWFQLSSSGPNSRLYM 
KLVMRILYLBSSEICFPTVPGCPGAWDVDSENPQRGSSVDAPPR 
PCHTTPDSQFGTEHVLRIHVLEAQDLIAKDRFLGGLVKGKSDPY 
VKLKLAGRSFRSHWREDLNPRWNEVFEVIVTSVPGQELEVEVF 
3KDLDKDDFLGRCKVRLTTVLNSGFLDEWLTLEDVPSGRLHLRL 
ERLTPRPTAAELEEVLQVNS L IQTQKSAELAAALLS I YMERAED 
LPLRKGTKHLSPYATLTVGDSSHKTKTISQTSAPVWDESASFLI 
RKPHTESLELQVRGEGTGVLGSLSLPLSELLVADQLCLDRWFTL 
SSGQGQVLLRAQLGILVSQHSGVEAHSHSYSHSSSSLSEEPELS 
GGPPHITSSAPEV\RQRLTHVDSPLEAPAGPLGQVKLTLWYYSE 
ERKLVSIVHGCRSLRQNGRDPPDPYVSLLLLPDKlirRGTKRRTSQ 
KKRTLSPEFNERFEWELPLDEAQRRKLDVSVKSNSSFMSREREL 
LGKVQLDLAETDLS QGVARWYDLMDNKDKGSS 


5427 


42 


3435 


ATSSQSLGRADPPRGGTMERSPGEGPSPSPMDQPSAPSDPTDQP 
PAAHAKPDPGSGGQPAGPGAAGEALAVLTSFGRRLLVLIPVYLA 
GAVGLSVGFVLFGLALYLGWRRVRDEKERSLRAARQLLDDEEQL 
TAKTLYMSHRELPAWVSFPDVEKAEWLNKIVAQVWPFLGQYMEK 
LLAETVAPAVRGSNPHLQTFTFTRVELGEKPLRIIGVKVHPGQR 
KEQILLDLNISYVGDVQIDVEVKKYFCKAGVKGMQLHGVLRVIL 
EPr.IGDLPFVGAVSMFFIRRPTLDIKWTGMTNLLDIPGLSSLSD 
TMIMDSIAAFLVLPNRLLVPLVPDLQDVAQLRSPLPRGIIRIHL 
LAARGLSSKDKYVKGLIEGKSDPYALVRLGTQTFCSRVIDEELN 
PQWGETYEVMVHEVPGQEIEVEVFDKDPDKDDFLGRMKLDVGKV 
LQASVLDDWFPLQGGQGQVHLRLEWLSLLSDAEKLEQVLQWNWG 
VSSRPDPP SAAI LWYLDRAQDLPMVTS EL YPPQLKKGNKE PNP 
MVQLSIQDVTQESKAVYSTNCPVWEEAFRFFLQDPQSQELDVQV 
KDDSRALTLGALTLPLARLLTAPELILDQWFQLSSSGPNSRLYM 
KLVMRILYLDSSEICFPTVPGCPGAWDVDSENPQRGSSVDAPPR 
PCHTTPDSQFGTEHVLRIHVLEAQDLIAKDRFLGGLVKGKSDPY 
VKLKLAGRSFRSHWREDLNPRWNEVFEVIVTSVPGQELEVEVF 
DKDLDKDDFLGRCKVRLTTVLNSGFLDEWLTLEDVPSGRLHLRL | 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 

residue of 
amino acid 
sequence 


Predicted er.d 
nucleotide 
location 
corresponding 

residue of 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E- 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methior_ine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








ERLTPRPTAAELEEVLQVNSLIQTQKSAELAAALLSIYMERAED" 
LPLRKGTKHLSPYATLTVGDSSHKTKTISQTSAPVWDESASFLI 
RKPHTESLELQVRGEGTGVLGSLSLPLSELLVADQLCLDRWFTL 
SSGQGQVLLRAQLC-ILVSQHSGVEAHSHSYSHSSSSLSEEPELS 
GGPPHITSSAPEV\RQRLTHVDSPLEAFAGPLGQVKLTLWYYSE 
ERKLVSIVHGCRSLRQNGRDPPDPYVSLLLLPDKNRGTKRRTSQ 
KKRTLSPEFNERFEWELPLDEAQRRKLDVSVKSNSSFMSR3REL 
LGKVQLDLAETDLSQGVARWYDLMDNKDKGSS 


5428 




1839 


SSRSERLSACAIAPPWLVSSRPARPAQLQRPGKMVEDGAEELED 
LVHFSVSELPSRGYGVMEEIRRQGKLCDVTLKIGDHKFSAHRIV 

NLAIDQ2NVQSLLMGASFLQLQSIKDACCTFLEERLHPKNCLGV 
RQFAETMMCAVLYDAANSFIHQHFVEVSMSEEFIALPLEDVLEL 
VSRDELNVKSEEQVFEAA1AWVRYDREQRGTFL\RNLQSNIRLL 
FCRPQFLSDRVQQDDLVRCCHKCRDLVDEAKDYLLMPERRPHLP 

CRPMTTARSRVGVAWNGLLYAXGGYDGQLRLSTVQAYNTETDT 
WTRVGSMNSKRSAMGTWLDGQIYVCGGYDGNSSLSSVETYSPE 
TDKWT\ATTSMSSNRSAA\GVTVFEGRI YVSGGHDGLQIFSSVEH 
YNHHTATWHPAAGMLNKRCRHGAASLGSKMFVCGGYDGSGFLSI 
AEMYSSV\ADQWCLIVPM\HTRR\SRVSLGGPAVGRLYAVWGVT 
TGQSNL\SSVGDVLTPETDCWTFM\APMACHEGGVGVGCIPLLT 
I 


5429 




202 


RHEDALGSEGCLWP3ESTVSGNGIPEPQVYAPPRPTDRLAVPPF 

LRDPEQQbEIiNRESVRAPPNRTIFDSDLMDSARLGGPCPPSSNS 
GISATCYGSGGRMEGPPP\TYSEVIGHYPGSKFQHQQSSGPPSL 
LEGTRLHHTHIAPLESAAIWSKEKDKQKGHPL 


5430 


441 


1507 


QKRRKRRRKKIMKTIQPKMHNSISWAIFTGLAALCLFQGVPVRS 
GDATFPKAMDNVTVRQGESATLRCTIDNRVTRVAWLNRSTILYA 
GNDKWCLDPRWLLSNTQTQYSIEIQNV3VYDEGPYTCSVQTDN 
HPKTSRVHLIVQVSPKIVEISSDISINEGNN1SLTCIATGRPEP 
TVTWRHI S PKAVGFVSEDE YLEI QG1 TREQSGDYE CS ASNDV\A 
APV\VRRVKVTVNYPPYISEAKGTGVPVGQKGTLQCEASAVPSA 
EFQWYKDDKRLI/EGKKGVKVENRPFLSKLIFFNVSEHDYGNYT 
CVASNKLGHTNASIMLFGPGAVSEVSNGTSRRAGCVWLLPLLVL 
HLLLKF 


5431 


2 


1312 


AAAAPGSRRRRPLPDRPHMAHGYEAPPPPAPRSPAWRARSKPV\ 
LPGITINP\TIAEGPSP\TSEGASEANLVDLQKKLEELELDEQQ 
KKRLEAFLTQKAKVGELKDDDFERISELGAGNGGWTKVQHRPS 
GL.IMARKLIHLEIKPAIRNQIIRELQVLHECNSPYIVGFYGAFY 
SDGEISICMEHMDGGSLDQVLKEAKRIPEEILGKVSIAVLRGLA 
YLREKHQIMHRDVKPSNILVNSRGEIKLCDFGVSGQL1DSMANS 
F : rr c s MAPERLQGTHYSVQSDIWSMGLSLVELAVGRYPIPPP 
DAKELEAIFGRPWDGEEGEPHSISPRPRPPGRPVSGHGMDSRP 
' 1 ^ELIDYIVNEPPPKLPNGVFTPDFQEFVNKCLIKNPAERA 
DLKMLTNHTFIKRSEVEEVDFAGWLCKTLRLNQPGTPTRTAV 


5432 


2 


1312 


AAAAPG£RRRRPLPDRPHMAHGYEAPPPPAPRSPAWRARSKPV\ 
LPGITIKP\TIAEGPSP\TSEGASEANLVDLQKKLEELELDEQQ 
KKRLEAFLTQKAKVGELKDDDFERISELGAGITGGWTKVQHRPS 
GLIMARKLIHLEIKPAIRNQIIREIiQVLHECNSPYIVGFYGAFY 
S DGEIS I CMEHMDGGSLDQVLKEAKRIPEE ILGKVS IAVLRGLA 
Yl IE -CIMHRDVKPSNILVNSRGElKLCDFGVSGQLIDSMANS 
FVGTRSYMAPERLQGTHYSVQSDIWSMGLSLVELAVGRYPIPPP 
DAKELEAIFGRPWDGEEGEPHSISPRPRPPGRPVSGHGMDSRP 
AM2 3LLDYIVNEPPPKLPNGVFTPDFQEFVWKCLIKNPAERA 
DLKMr.TNHTFIXRSHVEEVDFAGWLCKTLRLNQPGTPTRTAV 


5433 


360 


1885 


■ I WGFEDPLE-ILCSWRARACPCTWPHC/ CTGLLECLGFAGV 
LFGWPSLVFVFKNEDYFKDLCGPDAGPIGNATGQADCKAQDERF 
SLIFTLGSFMNNF.MTFPTGYIFDRFKTTVARIilAIFFYTTATLI 
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SEQ 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 

residue of 

sequence 


Predicted end 
nucleot ide 

corresponding 
to first 

amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V==Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








IAFTSAGSAVLLFLAMPMLTIGGILFLITNLQIGNLFGQHRSTI 
i r FLI IKLLYEKGISLR/VLLHLHLCLQYIjAC 
STHFPPDAPGAHP1PTAPQLQLWPVPWEWHHKGREG/QQLSMKT 
GSYSQRSSFQRRKRPQGQGRSRNSAPSGATL/CSRRFAWHLVWL 
SVIQLWHYLFIGTLNSLLTNMAGGDMARVSTYTNAFAFTQFGVL 
CAPWNGLLMDRLKQKYQKEARKTGSSTLAVALCSTVPSLALTSL 
LCLGFALCASVPILPLQYLTFILQVISRSFLYGSNAAFIiTLAFP 
SEHFGKLFGLVMALSAWSLLQFPIFTLIKGSLQNDPFYVNVMF 
MIAILLTFFHPFLVYRECRTWKESPSAIA 


5434 


66 


652 


RYAALIISLIQHKLLWRNQHCSRCVIMSPAQSAGLNWLF/GSGK' 
HGPFLGCSQYPACDYVRPLKSSADGHIVKVLEGQVCPACGANLV 
LRQGRFGMFIGCINYPECEHTBLIDKPDETAITCPQCRTGHLVQ 
RRSRYGKTFHSCDRYPECQFAINFKPIAGECPECHYPLLIEKKT 
AQGVKHFCASKQCC-KPVSAE 


5435 


1 /04 


1597 


PGDSSQRLAEMSNAKERKHAKKMRNQPTNVTLSSGFVADRGVKH 
HSGGEKPFQAQKQEPHPGTSRQRQTRVNPHSLPDPEVNEQSSSK 
GMFRKKGGWKAGPEGTSQE I PKY I TASTFAQARAAE ISAMLKAV 
TQKSSNSLVFQTLPRHMRRRAMSHNVKRLPRRLQEIAQKEAEKA 
VHQKKEHSKNKCHKARRCHMNRTLEFNRRQKKNIWLETHIWHAK 
RFHMVKKWGYCLGERPTVKSHRACYRAMTNRCLLQDDSYYCCLE 
LKGKEEEILKALSGMCNIDTGLTFAAVHCLSGKRQGSLVLYRVN 
KYPREMLGPVTFIWKSQRTPGDPSESRQLWIWLHPTLKQDILEE 
IKAACQCVEPIKSAVCIADPLPTPSQEKSQTELPDEKIGKKRKR 
KDDGENAKPIKKIIGDGTRDPCLPYSWISPTTGIIISDLTMEMN 
RFRL IGPLSHS I LTEAI KAAS VHTVGEDTEETPHRWWI ETCKKP 
DSVSLHCRQEAIFELLGGITSPAEIPAGTILGLTVGDPRINLPQ 
KKSKALPNPEKCQDNEKVRQLLLEGVPVECTHSFIWNQDICKSV 
TENKISDQDLNRMRSELLVPGSQLILGPHESKIPILLIQQPGKV 
TGEDRLGWGSGWDVLLPKGWGMAFWIPFIYRGVRVGGLKESAVH 
SQYKRSPNVPGDFPDCPAGMLFAEEQAKNLLEKYKRRPPAKRPN 
YVKLGTLAPFCCPWEQLTQDWESRVQAYEEPSVASSPNGKESDL 
RRSEVPCAPMPKKTHQPSDEVGTSIEHPREAEEVMDAGCQESAG 
PERITDQEASENHVAATGSHLCVLRSRKLLKQLSAWCGPSSEDS 
RGGRRAPGRGQQGLTREACLSILGHFPRALVWVSLSLLSKGSPE 
PHTMICVPAKEDFLQLHEDWHYCGPQESKHSDPFRSKILKQKEK 
KKREKRQKP\GRASSDGPAGEEPVAGQEALTLGLWSGPLPRVTL 
HCSRTLLGFVTQGDFSMAVGCGEALGFVSLTGLLDMLSSQPAAQ 
RGLVLLRPPASLQYRFARIAIEV 


5436 


1781 


635 


ASDSIPWSEARTTRKLAQRGCQWSLPERMPLWFCGLPYSGKSR 
KAEELRVALAAEGRAVYWDDAAVLGAEDPAVYGDSAREKALRG 
ALRASVSRRLSRHDWILDSLNYIKGFRYELY\CLARAARTPLC 
LVYCVRPGGPIAGPQVAGANENPGRNVSVSWRPRAEEDGRAQAA 
GSSVLRELHTADSWNGSAQADVPKELEREESGAAESPALVTPD 
SEKSAKHGSGAFYSPELLEALTLRFEAPDSRNRWDRPLFTLVGli 
EEPLPLAGIRSALF3NRAPPPHQSTQSQPLASGSFLHQLDQVTS 
QVLAGLMEAQKSAVPGDLLTLPGTTEHLRFTRPLTMAELSRLRR 
QFISYTKMHPHNENLPQLANMFLQYLSQSLH 


5437 


739 




:QEA 3E FGG E LHTPAMFLRRLGGWLPRPWGRRKPMRPDPPYPE 

PSKEEPQVEQLGSKRMDSLKWDQPISSTQESGRLEAGGASPKLR 
WDHVDSGGTRRPGVSPEGGlAGVPGPGAPLEKPGRREKLLGWLR 
GEPGAPSRYLGGPEECLQISTNLTLHLLELIASALLALCSRPLR 
AAIJDTLGLRGPLGLWLHGLLSFLAALHGLHAVLSLLTAHPLHFA 
CLFGLLQALVLAVSLREPNGDEAATDWESEGLEREGEEQRGDPG 
KGL 


5438 


2443 


1152 


r . RK RRHQPAS QRQRPWS S DS TGDLLARGKGRKEENKGSDRVS 
LAPPSLRRPMMCQSEARQGPELRAAKWnHFPQliALRRRLGQLSC 
MSRPALKLRSWPLTVLYYLLPFGALRPLSRVGWRPVSRVALYKS 
VPTRLLSRAWGRLNQVELPHWLRRPVYSLYIWTFGVNMKEAAVE 
DLHHYRHLSEFFRRKLKPQARPVCGLHSVISPSDGRILNFGQVK 
NCE VEQVKGVTYS LESFLGPRMCTEDLPFP PAA.S CDS FKNQLVT | 
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Predicted 
beginning 

responding 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucinc, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotic 



REGNELYHCVIYIAPGDYHCFHSPTDWTVSHRRHFPGSLMSVNP 
GMARWIKELFCHNERWLTGDWKHGFFSLTAVGAT\NWGSIRIY 
FDRDLHTNSPRHSKGSYNDFSFVTHTNREGVPMALRGEHLG/QS 
FNLGSTIVLIFEAPKDFNFQLKTGQKIRFGEALGSL 



TKPRKRRHQPASQRQRPWSSDSTGDLLARGKGRKEENKGSDRVS 
LAPPSLRRPMMCQSEARQGPELRAAKMLHFPQLALRRRLGQLSC 
MSRPALKLRSWPLTVLYYLLPFGALRPLSRVGWRPVSRVALYKS 
VPTRLLSRAWGRLNQVELPHWLRRPVYSLYIWTFGVNMKEAAVE 
DLHHYRNLSEFFRRKLKPQARPVCGLHSVISPSDGRILNFGQVK 
NCEVEQVKGVTYSLESFLGPRMCTEDLPFPPAASCDSFKNQLVT 
REGNELYHCV1YLAPGDYHCFHSPTDWTVSHRRHFPGSLMSVNP 
GMARWIKELFCHNERWLTGDWKHGFFSLTAVGAT\NWGS1RIY 
FDRDLHTNSPRHS KGS YNDFS FVTHTNREGVPMALRGEHLG /QS 
FNLGSTIVLIFEAPKDFMFQLKTGQKIRFGEAIiGSl] 



EPIPVTPDHRLVTMTHIV\QTFSPVNS\GQPPNYEMLKEEQEVA 
MLGAPHNPAPPMSTVIHIRSETSVPDHWWSLFNTLFMNTCCLG 
FIAFAYS VKSRDRKMVGDVTGAQAYASTAKCLN IWALILG I FMT 
ILLIIIPVLWQAQR 



CRDGGKWGFMVS PMKPLE I KTQCSGPRMDPKICPADPAFFS FIN 
NSDLWVANIETGEERRLTFCHQGLSNVLDDPKSAGVATFVIQEE 
JGLKTLRILYEEVDESEVEVIHVPSP 
AI,33RKTDSYRYPRTGSKNPKIALKLAEFQTDSQGKIVSTOEKE 
LVQi'FSSLFFKVSYIARAGWTRDGKYAWAMFTjDRPQQWLQLVLL 
PPALFIPSTEXEEQ\RLASARAVPRNVQPYWYEEVTNVWINVII 
DIFYPFPQSEGEDELCFLRANFCKTGFCHLYKVTAVLXSQGYDW 
SEPFSPGEGEQSLTNAIWVNEETKLVYFQGTKDTPLEHHLYWS 
YEAAGEIVRLTTPGFSHSCSMSQMFDMFVSHYSSVSTPPCVHVY 
KLSGPDDDPLHKQPRFWASMMEAAKIFHFHTRSDVRLYGMIYKP 
HALQPGKKHPTVLFVYGGPQVQLVNNSFKGIKYLRLHTLASLGY 
AVW1DGRGSCQRGLRFEGALKNQMGQVEIEDQVEGLQFVAEKY 
GFIDLSRVAIHGWSYGGFLSLMGLIHKPQVFKVAIAGAPVTVWM 
AYDTGYTERYMDVPENNQHGYEAGSVALHVEKLPNEPNRLLILH 
GFLDENVHFFHTNFLVSQLIRAGKPYQLQVALPPVSPQIYPNER 
HS IRCPESGEHYEVTLLHFLQEYL 



CGQRSRRRSPDMPEAKPAAKKAPKGKDAPKGAPKEAPPKEAPAE " 
APKEAPPEDQSPTAEEPTGVFLKKPDSVSVETGKDAWVAKVNG 
KELPDKPTIKWFKGKWLELGSKSGARFSFKESHNSASNVYTVEL 
HIGKWLGDRGYYRLEVKAKDTCDSCGFNIDVEAPRQDASGQSL 
ESFKRTSEKKSDTAGELDFSGLLKKREWEEEKKKKKKDDDDLG 
1PPE1WELLKGAKKSEYEKIAFQYGITDLRGMLKRLKKAKVEVK 
KSA^TKKLDPAYQVDRGNKIKLMVEISDPDLTLKWFKNGQEIK 
PSSKYVFENVGKKRILTINKCTLADDAAYEVAVKDEKCFTELFV 
KEPPVLIVTPLEDQQVFVGDRVEMAVEVSEEGAQVMWMKDGVEL 
TREDS F KAR YRFKKDGKRHI L I FSD WQEDRGR YQVI TNGGQ CE 
AELIVEEKQLEVLQDIADLTVKASEQAVFKCEVSDEKVTGKWYK 
NGVEVRPSKRITISHVGRFHKLVIDDVRPEDEGDYTFVPDGYAL 
GSLSAKLNFLEIKVEYVPKQ\BPPKIPLGFASGGKTSENAD/IV 
WAGNKLRLDV\ S I TGEAPS PFAT \ VJLKG\ DEVFT TTEGRTR I E 
KRVDCSS F VI ES AQREDEGRYTI KVTNP IGEDVAS I FLQWDVP 
DPPEAVRITSVGEDWAILVWEPPMYDGGKPVTGYLVERKKKGSQ 
RWMKLNFEVFTETTYESTKMIEGILYEMRVFAVNAIGVSQPSMN 
TKPFMPIAPTSEPLHLIVEDVTDTTTTLKWRPPNRIGAGGIDGY 
LVEYCLEGSEEWVPANTEPVERCGFTVKNLPTGARILFRWGVN 
IAGRSEPATLAQPVTIREIAEPPZIRLPRHLRQTYIRKVGEQLN 
LWPFQGKPRPQWWTKGGAPLDTSRVHVRTSDFDTVFFVRQAA 
RSDSGEYELSVQIEHMKDTATIRIRWEKAGPPINVMVKEVWGT 
NALVEWQAPKDDGNSEIMGYFVQKADKKTMEWFNVYERNRHT3C 
TVSDLIVGNEYYFRVYTENICGLSDSPGVSKNTARILKTGITFK 
PFEYKEHDFRMAPKFLTPLIDRVWAGYSAALNCAVRGHPKPKV 
VWMKNKME IREDPKFLI TWYQGVL TLWIRRPS P FDAGTYTCRAV 
NELGEAIiAECKLEVRVPQ 
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SEQ 
ID 
NO: 


Predicted 
beginning 

Location 
corresponding 

sequence 


E 1 1 cted end 
nucleotide 

corresponding 

amino acid 
residue of 
amino acid 


s-- lino ac^d segmenc containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
Vpossible nucleotide insertion) 


5443 


66 


1003 


SHGQLDAGQSSEQHGGNRQPEQSRSRSSSSSSSPRRSRSAAEPA 
MALSMPLNGLKEEDKEPLIELFVKAGSDGESIGNCPFSQRLFMI 
LWLKGWFSVTTVDLKRKPADLQNLAPGTHPPFITFNSEVKTDV 
NKIEEFLEEVLCPPKYLKLSPKHPESNTAGMDIFAKFSAYIKNS 
RPEANEALERGLLKTLQKLDEYLNSPLPDEIDENSMEDIKFSTR 
KFLDGNEMTLADCNLLPKLHIVKVVAKKYRNFDIPKEMTGIWRY 
LTNAYSRDEFTNTCPSDKEVE I \AYSDVAKRLHQVKSRLLKEVS 
FMSSP 


5444 


2 


344 


SGPIGVTGAQMAKWLRDYLSFGGRRPPPQPPTPDYTESDILRAY 
RAQKNLDFEDPY*DSESRLEPDPAGPGDSKNPGDAKYGSPKHRL 
IKVEAADMARAKALLGGPGEELEADTEYLDPFDAQPHPAPPDDG 
YMEPYDAQWVMSELPGRGVQLYDTPYEEQDPETADGPPSGQKPR 
QSRMPQEDERPADEYDQPWEWKKDHISRAFAVQFDSPEWERTPG 
SAKELRRPPPRSPQPAERVDPALPLEKQPWFHGPLNRADAESLL 
SLCKEGSYLVRLSETNPQDCSLSLRSSQGFLHLKFARTRENQW 
LGQHSGPFPSVPELVLHYSSRPLPVQGAEHLALLYPWTQTP*Q 
*PDWGDRRPNGQVATGLPELWGAEAPSAAAHPGLHRERHPEGLP 
RAEKPGLRGPLLGLRE PLGAGPRG P WGLQE PRRCQVWFS QAPAH 
QGGGCGYGQSQGPSGRPRGGAGSRH 


5445 


2364 


486 


ILSRGFI^SVEICIQLPLPASEPVLLLTWAKRRWRETRSRREPT 
TLRAQSVCPWW I * ETRMNRS I PVEVDESEP YPS QLLKP I PEYSP 
EEFSEPPAPNIRNMAPNSI.SAPTMLHNSSGDFSQAHSTLKLANH 
QRPVSRQVTCLRTQVLEDSEDSFCRRHPGLGKAFPSGCSAVSEP 
ASESWGALPAEHQFSFMEKKNQWLVSQLSAASPDTGHDSDKSD 
C.SLPNASADSLGGSQEMVQRPQPHRNRAGLDLPTIDTGYDSQPQ 
DVLGIRQLERPLPLTSVCYPQDLPRPLRSREFPQFEPQRYPACA 
QMLPPNLSPHAPWNYHYHCPGSPDHQVPYGHDYPRAAYQQVTQP 
ALPGQPLPGASVRGLHPVQKVILNYPSPWDQEERPAQRDCSFPG 
LPRHQDQPHHQPPNRAGAPGESLECPAELRPQVPQPPSPAAVPR 
FPSNPPARGTLKTSNLPEELRKVFITYSMDTAMEWKFVNFLLV 
NGFQTAIDIFEDRIRG IDI IKWMERYLRDKTVMI I VAIS PKYKQ 
CVEGAESQLDEDEHGLHTKYIHRMMQIEFIKQGSMNFRFIPVLF 
PNAKKEHVPTWLQNTHVYSWPKNKKNILLRLLREEEYVAPPRGP 
LPTLQWPL 


5445 


972 


161 


SSWSWCTGRMRKTRLWGLLWMLFVSELRAATKLTEEKYELKEGQ 
TLDVKCDYTLEKFASSQKAWQIIRDGEMPKTLACTERPSKWSHP 
VQVGRIILEDYHDHGLLRVRMVMLQVED5GLYQCVIYQPPKEPH 
MLFDRIRLWTKGFSGTPGSNENSTQNVYKIPPTTTKALCPLYT 
TPRTVTQAPPKSTADVSTPDSEINLTNVTDIIRVPVFNI VILLA 
GGFLSKSLVFSVLFAVTLRSFVP*AHEPTRMSSDFQPHPSGSCA 
KGGGRR 


5447 


207 


617 


MTARTLSLMASLVAYDDSDSEAETEHAGSFNATGOQKDTSGVAR 
PPGQDFASGTLDVPKAGAQPTKHGSCEDPGGYRLPLAQLGRSDR 
GSCPSQRLQWPGKEPQVTFPIKEPSCSSLWTSHVPASHMPLAAA 
RFKQVKLSRNFPKSSFHAQSESETVGKNGSSFQKKKCEDCWPY 
TPRRLRQRQALSTETGKGKDVEPCGPPAGRAPAPLYVGPGVSEF 
IQPYLNSHYKETTVPRKVLFHLRGHRGPVNTIQWCPVLSKSHML 
LSTSMDKTFKVWNAVDSGHCLQTYSLHTEAVRAARWAPCGRRIL 
SGGFDFALHLTDLETGTQLFSGRSDFRITTLKFHPKDHNIFLCG 
GFSSEMKAWDIRTGKVMRSYKATTQQTLDILFLREGSEFLSSTD 
ASTRDSADRTIIAWDFRTSAKISNQIFHERFTCPSLALHPREPV 
FLAQTIIGNYLALFSTVWPYRMSRRRRYEGHKVEGYSVGCECSPG 
GDLLVTG5ADGRVLMYSFRTASRACTLQGHTQACVGTTYHPVLP 
SVLATCSMGGDMKIWH*AFHWLSLGEAIGDLAPARGYSGPGRSL 
KSPSPSK3 LLVLLCGRAMFQPATCPWQLPALS K 


5448 


194 


1833 


M AS KVTDA I VW YQKKIGAYDQQ I WE KS VEQRE I KGLRNKP KKTA 
HVKPDLIDVDLVRGSAFAKAKPESPWTSLTTKGIVRWFFPFFF 
RWWLQVTS KVI FFWLL VL YLLQVAAI VL FCSTSS PHS IPLTE VI 
GPIWLMLLLGTVHCQIVSTRTPKPPLSTGGKRRRKLRKAAHLEV 
HREGDGSSTTDNTQEGAVQ1THGTSTSHSVGTVFRDLWHAAFFLS 
GSKKAKKSIDKSTETDNGYVSLDGKKTVKSGEDGIQNHEPQCET 
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ID 
NO: 


beginning 

corresponding 

amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 
to first 

residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, XsUnknown, *=Stop 
Codon, /=possible nucleotide deletion, 








IRPEETAWNTGTLRNGPSKDTQRTITNVSDEVSSEEGPETGYSL 
RRHVDRTSEGVLRNRKSHHYKKHYPNEDAPKSGTSCSSRCSSSR 
QDSESARPESETEDVLWEDLLHCAECHSSCTSETDVENHQINPC 
VKKEYRDDPFHQSHLPWLHSSHPGLEKISAIVWEGNDCKKADMS 
VLEISGMIMNRVNSHIPGIGYQIFGNAVSLILGLTPFVFRLSQA 
TDLEQLTAHSASELYVIAFGSNEDVIVLSMVIISFWRVSLVWI 
FFFLLCVAERTYKQVGI M * TSEGVLRNRKSHHYKKHYPNEDAPK 
SGTSCSSRCSSSRQDSESARPESETEDVLWEDLLHCAECHSSCT 
SETDVENHQINPCVKKEYRDDPFHQSHLPWLHSSHPGLEKISAI 
VWEGNDCKKADMSVLEISGMIMNRVNSHIPGIGYQIFGNAVSLI 
LGLTPFVFRLSQATDLEQLTAHSASELYVIAFGSNEDVIVLSMV 
IISFWRVSLVWIFFFLLCVAERTYKQVGIM 


5449 




1833 


KASKVTDAIVWYQKKIGAYDQQIWEKSVEQREIKGLRNKPKKTA 
HVKPDLIDVDLVRGSAFAKAKPESPWTSLTTKGIVRWFFPFFF 
RWWLQVTS KVI FFWLLVL YLLQ VAAI VL FCSTSS PHS IPLTEVI 
GPIWLMLLLGTVHCQIVSTRTPKPPLSTGGKRRRKLRKAAHLEV 
IIREGDGSSTTDNTQEGAVQNH3TSTSHSVGTVFRDLWHAAFFLS 

IRPEETAWNTGTLRNGPSKDTQRTITNVSDEVSSEEGPETGYSL 

RRHVDRTS EGVLRNRKSHHYKKHYPNEDAPKSGTS CSSRCS 53 SR 

QDSESARPESETEDVLWEDLLHCAECHSSCTSETDVENHQINPC ' 

VKKEYEDDPFHQSHLPWLHSSHPGLEKISAIVWEGNDCKKADMS 

VLEISGMIMNRVNSHIPC-IGYQIFGNAVSLILGLTPFVFRLSQA 

TDLEQLTAHSASELYVIAFGSNEDVIVLSMVIISFWRVSLVWI 

FFFLLCVAERTYKQVGIK*TSEGVLRNRKSHHYKKHYPNEDAPK 

SGTSCSSRCSSSRQDSESARPESETEDVLWEDLLHCAECHSSCT 

SETDVENHQINPCVKKEYRDDPFHQSHLPWLHSSHPGLEKISAI 

VWEGNDCKKADMSVLEISGMIMNRVNSHIPGIGYQIFGNAVSLI 

LGLTPFVFRLSQATDLEQLTAHSASELYVIAFGSNEDVIVLSMV 

IISFWRVSLVWIFFFLLCVAERTYKQVGIM 


5450 


8I3S 


1242 


GQQFASFFG*NHPEVTVAMALTDIDLQLQFSMSQPEALLLLAAG 
PADHLLLQLYSGHLQVRLVLGQEELRLQTPAETLLSDSIPHTW 
LTWEGWATLSVDGFLNASSAVPGAPLEVPYGLFVGGTGTLGLP 
YLRGTSRPLRGCLHAATLNGRSLLRPLTPDVHEGCAEEFSASDD 
VALGFSGPHSLAAFPAWGTQDEGTLEFTLTTQSRQAPLAFQAGG 
RRGDFIYVDIFEGHLRAWEKGQGTVLLHNSVPVADGQPHEVSV 
HINAHRLEISVDQYPTHTSNRGVLSYLEPRGSLLLGGLDAEASR 
HLQEHRLGLTPEATNASLLGCMEDLSVNGQRRGLREALLTRNMA 
AGCRLEEEEYEDDAYGHYEAFSTLAPEAWPAMELPEPCVPEPGL 
PPVFANFTQLLTISPLWAEGGTAWLEWRHVQPTLDLMEAELRK 
SQVLFSVTRGAHYGELELDILGAQARKMFTLLD\'VNRKARFIHD 
GSEDTSDQLVLEVSVTARVPMPSCLRRGQTYLLPIQVNPVHDPP 
HIIFPHGSLMVILEHTQKPLGPEVFQAYDPDSACEGLTFCVLGT 
SSGLPVERRDQPGEPATEFSCRELEAGSLVYVHCGGPAQDLTFR 
VSDGLQASPPATLKWAIRPAIQIHRSTGLRLAQGSAMPILPAN 
LSVETNAVGQDVSVLFRVTGALQFGELQKHSTGGVEGAEWWATQ 
AFHQRDVEQGRVRYLSTDPQHHAYDTVENLALEVQVGQEILSNL 
SFPVTIQRATVKMLRLEPLHTQNTQQETLTTAHLEATLEEAGPS 
PPT FH YE WQAFRKGNLQLQGTRLSDGQGFTQDD IQAGRVTYGA 
TARASEAVEDTFRFRVTAPPYFSPLYTFPIHIGGDPDAPVLTNV 
LLWPEGGEGVLSADHLFVKSLNSASYLYEVMERPRLGRLAWRG 
TQDKTTMVTSFTNEDLLRGRLVYQHDDSETTEDDIPFVATRQGE 
SSGDMAWEEVRGVFRVAIQPVNDHAPVQTISRIFHVARGGRRLL 
TTDDVAFSDADSGFADAQLVLTRKDLLFGSIVAVDEPTRPIYRF 
TQEDIiRKRRVLFVHSGADRGWIQLQVSDGQHQATALLEVQASEP 
YLRVANGSSLWPQGGQGTIDTAVLHLDTNLDIRSGDEVHYHVT 
AGPRWGQLVRAGQPATAFSQQDLLDGAVLYSHNC-SLSPEDTMAF 
S VEAGPVHTDATLQVTIALEGPLAPLKLVRHKKI YVFQGEAAE I 
RRDQLEAAQEAVPPADIVFSVKSPPSAGYLVMVSRGALADEPPS 
LD PVQS FSQEAVDTGRVL YLHS RP EAW S DAFS LDVASGLGAP LE 
GVLVELEVLPAAIPLEAQNFSVPEGGSLTLAPPLLRVSGPYFPT 
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Predicted 
beginning 
nucleotide 



Predicted end 
location 



Amino acid segment containing signal pepti 
(A=Alanrine, C=Cysteine, D=Aspartic Acid, I 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stc 
Codon, /=possible nucleotide deletion, 
\=possible 



LLGLSLQVLEPPQHGPLQKEDGPQARTLSAFSWRMVEEQLIRYV 
HDGSETLTDSFVLMANASEMDRQSHPVAFTVTVLPVNDQPPILT 
TNTGLQMWEGATAPIPAEALRSTDGDSGSEDLVYTIEQPSNGRV 
VLRGAPGTEVRSFTQAQLDGGLVLFSHRGTLDGGFPFRLSDGEH 
TSPGHFFRVTAQKQVLLSLKGSQTLTVCPGSVQPLSSQTLRASS 
SAGTDPQLLLYRVWGPQLGRLFHAQQDSTGEALVNFTQAEVYA 
GNILYEHEMPPEPFWEAHDTLELQLSSPPARDVAATLAVAVSFE 
AACPQRPSHLWKNK3LWVPEGQRARITVAALDASNLLASVPSPQ 
RSEHDVLFQVTQFPSRGQLLVSEEPLHAGQPHFLQSQLAAGQLV 
YAHGGGGTQQDGFHFRAHLQGPAGASVAGPQTSEAFAITVRDVN 
ERPPQPQASVPLRLTRGSRAPISRAQLSWDPDSAPGEIEYEVQ 
RAPHNGFLSLVGGGLGPVTRFTQADVDSGRLAFVANGSSVAGIF 
QLSMSDGASPPLPMSLAVDILPSAIEVQLRAPLEVPQALGRSSL 
SQQQLRWSDREEPEAAYRLIQGPQYGHLLVGGRPTSAF3QFQI 
DQGEWFAFTNFSSSHDHFRVLALARGVNASAVrVNVTVRALLHV 
WAGGPWPQGATLRLDPTVLDAGELANRT3SVPRFRLLEGPRHGR 
WRVPRARTEPGGSQLVEQFTQQDLEDGRLGLEVGRPEGRAPGP 
AGDSLTLELWAQGVPPAVASLDFATEPYNAARPYSVALLSVPEA 
ARTEAGKPESSTPTGEPGPMASSPEPAVAKGGFLSFTjEAMMFSV 
IIPMCLVXiLLALILPLLFYLRKRNK'TGKHDVQVLTAKPRKGUA 
GDTETFRKVEPGQAIPLTAVPGQGPPPGGQPDPELLQFCRTPNP 
ALKNGQYWV 



RDSSEQGRTGDTLGRPSACMDALKPPCLWRNHERGKKDRDSCGR 
KNGEPCCPIISLEALRDAAPSQGLNFLLLFTKMLFIFNFLFSPIjP 

tpaliciltfgaaiflwlitrpqpvlplldlnnqsvgieggark 
gvsqknndltsccfsdaktmyevfqrglavsdngpclgyrkpnq 
pyrwlsykovsdra3ylgscllhkgyksspdqfvgifaqhrpf.w 
iiselacytysmvavplydtlgpeaivhivwkadiamvicdtpq 
kalvlignvekgftpslkviilmdpfdddlkqrgeksgieilsl 
ydaenlgkehfrkpvppspedlsvicft3gttgdpkgamithqn 
ivsnaaaflkcvehayeptpddvaisylplahmfer1vqawys 
cgarvgffqgdirlladdmktlkptlfpavprllnriydkvqne 
aktplkkfllklavs5kfkelqkgiirhdsfwdklifakiqdsl 
ggrvrvivtgaapmstsvmtffraamgcqvyeaygqtectggct 
ftlpgdwtsghvgvpiacnyvkledvadmnyftvtojegevcikg 
tnvfkgylkdpektqealdsdgwlhtgdigrwlpngtlkiidrk 
knifklaqgeyiapekieniynrsqpvlqifvhgeslrsslvgv 
wpdtdvlpsfaaklgvkgsfeelcqnqwreailedlqkigke 
sglktfeqvkaiflhpepfs:englltptlkakrgelskyfrtq 
idslyehiqd 



srvpslclslslslspsrepvagapgcgtagppamatlwggllr 
lgsllslsclalsvlllaqlsdaaknfedvrckcicppykensg 
hiynknisqkdcdclhvvepmpvrgpdveayclrceckyeerss 
vrikvtiiiylsllgllllymvyltlvepilkrrlfghaqliqs 
dddigdhqpfanahdvlarsrsranvlnkveyaqqrwklqvqeq 
rksvfdrhwls 



PSIPAAVPQSAPPEPHREETVTATATSQVAQQPPAAAAPGEQAV 
AGPAPSTVPSSTSKDRPVSQPSLVGSKEEPPPARSGSGGGSAKE 
PQEERSQQQDDIEELETKAVGMSWDGRFLKFDIEIGRGSFKTVY 
KGLDTETTVEVAWCELQDRKLTKSERQRFKEEAEMLKGLQHPNI 
VRFYDSWESTVKGKKCIVLVTELMTSGTLKTYLKRFKVMKIKVL 
RSWCRQinXGLQFLHTRTPPIIHRDLKCDWIFrTGPTGSVKIGD 
LGLATLKRASFAKSVIGTPEFMAPEMYEEKYDESVDVYAFGMCM 
LEMATSEYPYSECQNAAQIYRRVTSGVKPASFDKVAIPEVKEII 
EGCIRQNKDERYSIKDLLWHAFFQEETGVRVEIAEEDDGEKIAI 
KLWLRIEDIKKLKGKYKDNEAIEFSFDLERNVPEDVAQEMVESG 
YVCEGDHKTMAKAIKDRVSLIKRKREQRQL* 



PSIPAAVPQSAPPEPHREETVTATATSQVAQQPPAAAAPGEQAV 
AGPAPSTVPSSTSKDRPVSQPSLVGSKEEPPPARSGSGGGSAKE 
PQEERSQQQDDIEEIiETKAVGMSWDGRFLKFDIEIGRGSFKTVY 
KGLDTETTVEVAWCELQDRKLTKSERQRFKEEAEMLKGLQHPNI 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 
location 

amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 

P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








VRFYBSKESTVK3KKCIVLVTELMTSGTLKTYLKRFKVMKIKVL 
RSWCRQILKGLQFLHTRTPPIIHRDLKCDNIFITGPTGSVKIGD 

LEMATSE YP YSECQNAAQ I YRRVTSGVKPAS FDKVAIPEVKE I I 
EGCIRQNKDERYSIKDLLNHAFFQEETGVRVELAEEDDGEKIAI 
KLWLRIED I KKLKGKYKDNEAI E FSFDLERNVPEDVAQEMVESG 
YVCEGDHKTMAKAIKDRVSLIKRKREQRQL* 


5455 


1359 


377 


LTMVSPATRKSLPKVKAMDFITSTAI LPLLFGCL3VFGLFRLLQ 
WVRGKAYLRNAVWITGATSGLGKECAKVFYAAGAKLVLCGRNG 
GALEELIRELTA3HATKVQTHKPYLVTFDLTDSGAIVAAAAEIL 
•QCFGYVDILWNAGISYRGTIMDTTVDVDKRVMETNYFGPVALT 
KALLPSMIKRRQGHIVAISSIQGKMSIPFRSAYA^SKHATQAFF 
DCLRAEMEQYE I EVTVI S PGYIHTNLS VNAI TADGSRYGVMDTT 
TAQGRSPVEVAQDVLAAVGKKKKDVILADLLPSLAVYLRTLAPG 
LFFSLMASRARKERKSKNS 


545C 


2 


2 33 2 


CGAGLVAAGAVLVLYPASRAGERTRVPGSPAPSSLPLHSPGACG 
TEVDMDPQRSPLLEVKGNIELKRPLIXAPSQLPLSGSRLKRRPD 
QMEDGLEPEKKRTRGLGATTKITTSHPRVPSLTTVPQTQGQTTA 
QKVSKKTGPRCSTAIATGLKNQKPVPAVPVQKSGTSGVPPMAGG 
KKPSKRPAWDLKGQLCDLNAELKRCRERTQTLDQENQQLQDQLR 

EERLSTQEGLVQELQKKQVELQEERRGLMSQLEEKERRLQTSEA 
ALSSSQAEVASLRQETVAQAALLTEREERLHGLEMERRRLHNQL 
QELKGNIRVFCRVRPVLPGEPTPPPGLIjLFPSGPGGPSDPPTRIj 
SLSRSD3RRGTLSGAPAPPTRHDFSFDRVFPPGSGQDEVFEEIA 
MLVQSALDGYPVCIFAYGQTGSGKTFTMEGGPGGDPQLEGLIPR 
ALRHLFSVAQELSGQGWTYSFVASYVEIYNETVRDT.LATGTRKG 
QGGECEIRRAGPGSEELTVTNARYVPVSCEKEVDALLHLARQNR 
AVARTAQNERSSRSHSVFQLQISGEHSSRGLQCGAPLSLVDLAG 
SERLDPGLALGPGERERLRETQAINSSLSTLGLVIMALSNKESH 
VPYRNSKLTyLLQNSLGGSAXMLMFVNISPLEEOTSESLMSLRF 
ASKVEPSVLFGTAQ3NRKWKTDPDLCVCVCVCVCVCVCVCVCVP 
MSM YRVRGGRVAGG CF IG WRAPCPRA I K 


5457 


2 


1540 


DDFVERRRWTRTTCLVRSPPHVPVCGHACSWNGGSLDPLKGTPA 
LLREAERLMRKVKKLRLDKEKTGSWRSFSLNSEGAERMATTGTP 
TADRGDAAATDDPAARFQVQKHSWDGLRSIIHGSRKYSGL1VNK 
APHDFQFVQKTDESGPHSHRLYYLGMPYGSRENSLLYSEIPKKV 
RKEALLLLSWKQMLDHFQATPHHGVYSREEELLRERKRLGVFGI 
TSYDFHSESGLFLFQASNSLFHCRDGGKNGFMVSPGPGCVSPMK 
PLEIKTQCSGPRMDPKICPADPAFFSFINNSDLWVANIETGEER 
RLTFCHQGLSKVLDDPKSAGVATFV1QEEFDRFTGYWWCPTASW 
EGSEGLKTLRILYEEVDESEVEVIHVPSPALEERKTDSYRYPRT 
GSKNPKIALKLAEFQTDSQGKIVSTQEKELVQPFSSLFPKVEYI 
ARAGWTRDGKYAWAMFLDRPQQWLQLVLLPPALFIPSTENEEQA 
ASLCQS CPQECPAVCGVRGGHQRIiDQCS 


5458 


6642 


4022 


?vpg:re?qwepaqpsatmsapseeeeyarlvmeaqpewlraev 
krlshelaertrekiqaaeyglavleekhqlklqfeelevdyea 
irsekeqlkeafgqahtnhkkvaadgesreesliqesaskeqyy 

VRKVLELQTELKQLRNVLTNTQSENERLASVAQELKEIKQNVEI 
QRGRLRDDIKEYKFREARLLQDYSELEEEN1SLQKQVSVLRQWQ 
VEFEGLKHEIKRLEEETEYLNSQLEDAIRLKEISERQIjEEALET 
LKTEREQKNSLRK.ELSHYMS1NDSFYTSHLHVSLDGLKFSDDAA 
EPNNDAEALVNGFEHGGLAKLPLDNKTSTPKKEGLAPPSPSLVS 
DLLSELNISEIQKLKQQLMQMEREKAGLLATLQDTQKQLEHTRG 
SLSEQQEKVTRLTENLSALRRLQASKERQTALDKEKDRDSHEDG 
DYYEVDINGPEILACKYHVAVAEAGELREQLKALRSTHEAREAQ 
HAEEKGP.YEAEGQALTEKVSLLEKASRQDRELLARLEKELKKVS 
DVAGETQGSLSVAQDELVTFSEEIjANLYHHVCMCNNETPNRVML 
DYYREGQGGAGRTSPGGRTSPEARGRRSPILLPKGLLAPEAGRA 
DGGTGDSSPSPGSSLPSPLSDPRREPKNIYNLIAIIRDQIKHLQ 
AAVDRTTELSRQRIASQELGPAVDKDKEALMEEILKLKSLLSTK 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 

amino acid 
sequence 


| Predicted end 

corresponding 
to first 

residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
{A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyro3ine, X=Unknown, *=Stop 
Codon, /=posslble nucleotide deletion, 
\=possible nucleotide insertion) 








REQITTLRTVLKANKQTAEVALANLKSKYENEKAMVTETMMKLR 
NELKALKEDAATFS SLRAMPATRCDEYI TQLDEMQRQLAAAEDE 
KKTLNSLLRMAIQQKLALTQRLELLELDHEQTRRGRAKAAPKTK 
PATPSVSHTCAGASDRAEGTGLANTQVFCSEKHSIYCD 


5459 




1262 


RGGHRbSGMASNFKDIVKQGYVRIRSRRLGIYQRCWLVFKKASS 
KGPKRLEKFSDERAAYFRCYHKVTELNNVKNVARLPKSTKECHAI 
GIYFNDDTSKTFACESDLEADEWCKVLQMECVGTRINDISLGEP 
DLuATGVEREQS ERFNVYLMPS PNLGC YMGECALQITYEYI CLW 
DVQNPRVKLISWPLSALRRYGRDTTWFTFEAGRMCETGEGLFIF 
QTRDGEAIYQKVHSAALAIAEQHERLLQSVKNSMLQMKMSERAA 
SLSTMVPLPRSAYWQHITRQHSTGQLYRLQDVSSPLKLHRTETF 
PAYRSEH 


5460 


45 


2097 


RPGCRA3ELSTG3RARERVRNRVSAPCGQDSRRCDPEVLRGRSP 
GLGLAEMP SCGACTCGAAAVRL ITSS LASAQRG I SGGRI HMSVL 
GRLGTFETQILQRAPLRSFTETPAYFASKDG1SKDGSGDGNKKS 
ASEGSSKXSGSGNSGKGGN0LP.CPKC3DLCTHVETFVSST1FVK 
CEKCHHFFWLSSADSKKSIIKEPESAAEAVKLAFQQKPPPPPK 
KIYNYLDKYWGQSFAKKVLSVAVYNHYKRIYNNIPANLRQQAE 
VEKQTSLTPRELEIRRREDEYRFTKLLQIAGISPHGNALGASMQ 
QQVHQQIPQEKRGGEVLDSSHDDIKLEKSNILLLGPTGSGKTLL 
AQTLAKCLDVPFAICDCTTLTQAGYVGEDIESVIAKLLQDANYN 
VEKAQQGIVFLDEVDKIGSVPGIHQLRDVGGEGVQQGLLKLLEG 
TIVKVPEKNSRKLRGETVQVDTTNIIjFVASGAFNGLDRI ISRRK 
NEKYLGFGTPSNLGKGRRAAAAADLANRSGESNTHQDIEEKDRL 
LRHVEARDLIEFGMIPBFVGRLPVWPLHSLDEKTT.VQILTEPR 
NAVIPQ/'^T 1 r ! f?' T^DALKAIARLALERKTGARGLiRS 
IMEKLLLEPMFEVPNSDIVCVEVDKEWEGKKEPGYIRAPTKES 
SEEEYDSGVEEEGWPRQADAANS 


54S1 






inpppppkspcgrarkwrrrrrpgapeaavmelpsgpgperlfd 
shrlpgdcflllvlllyapvgfcllvlrlflgihvflvscalpd 
svlrrfwrtmcavlglvarqedsglrdhsvrvlisnhvtpfdh 
nivnllttcstpllnsppsfvcwsrgfmemngrgelveslkrfc 
astrlpptplllfpeeeatngregllrfsswpfsiqdwqpltl 
qvqrplvsvtvsdaswvsellwslfvpftvyqvrwlrpvhrqlg 
eaneefalrvqqlvakelgqtgtrltpadkaehmkrqrhprlrp 
qsaqs s fpps pqps pdvqlatlaqrvke vlphvplgviqrdlak 

TGCVDLTITNT.LEGAVAFMPFDITKGTQSLPTASASKFPSSGPV 
TPQPTALTFAKSSWARQESLQERKQALYEYARRRFTERRAQEAD 


5462 


SS3 


3353 


XIKERQMSANNSPPSAQKSVLPTAIPAVLPAASPCSSPKTGLSA 
RLSNGSFSAPSLTNSRGSVHTVSFLLQIGLTRESVTIEAQELSL 
SAVKDLVCSIVYQKFPECGFFGMYDKILLFRHDMNSENILQLIT 
SADEIHEGDLVEWLSALATVEDFQIRPHTLYVHSYKAPTFCDY 
CGEMLWGLVRQGLKCEGCGLWYHKRCAFKrPNWCSGTOKRRLSW 
VSLPGPGL5VPRPLQPEYVALPSEESHVHQEPSKRIPSWSGRPI 
NME f M VMCR VKVPHTFAVHS YTRPT I CQYCKRLLKGLFRQGMQC 
KDCKFNCHKRCASKVPRDCLGEVTFNGEPSSLGTDTDIPMDIDN 
NOINSDSSRGLDDTEEPSPPEDKMFFLDPSDLDVERDEEAVKTI 
SPSTSHNIPLMRWQSIKHTKRKSSTMVKEGWMVHYTSRDNLRK 
RHYWRLDSKCLTLFQNESGSKYYKEIPLSEILRISSPRDFTNIS 
QGSNPHCFEIITDTMVYFVGENNGDSSHNPVLAATGVGLDVAQS 
WSKAIRQALMPVTPQASVCTSPGQGKDHKDLSTSISVSNCQIQE 
NVDISTVYQ IFADE VLGSGQFG I VYGGKHRKTGRD VAIKVIDKM 
RFPTKQESQLRNEVAILQNLHHPGIWLECMFETPERVFWMEK 
LHGDMLEMILSSEKSRLPERITKFMVTQILVALRNLHFKNIVHC 
DLK?EN\'LLASAEPFPQVKLCDFGFARIIGEKSFRRSWGTPAY 
LAPEVLRSKGYNRSliDMWSVGVIIYVSLSGTFPFNEDEDINDQI 
QNAAFMYPPWPWREISGEAIDLIflNLLQViCMRKRYSVDKSLSHP 
WLQDYQTWLDLREFETRIGERYITHESDDARWElHAYTHNIjVYP 
ICHFIMAPWPDDMEEDP 


5463 


237 


1012 


LLSVTMTTSRCSHLPEVLPDCTSSAAPWKTVEDCGSLVNGQPQ 
YVMQVS AKDGQLLSTWRTLATQS PFNDRPMCRI CHEGS SQEDL 
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SEQ 
ID 
NO: 


Predicted 

nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


Predicted end 
nucleotide 
location 
corresponding 

amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylal&nine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, K=Methionine, N=Aeparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W= Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LSPCECTGTLGTIHRSCLEHWLSSSNTSYCELCHFRFAVERKPR 
PLVEWLRNPGPQHEKRTLFGDMVCFLFITPLATISGWLCLRGAV 
DHLHFSSRLEAVGLIALTVALFTIYLFWTLVSFRYHCRLYNEWR 
RTNQRVILLIPKSVNVPSNQPSLLGLHSVKRNSKETW 


54S4 


195 


£77 


SPSMNPRKKVDLKLI IVGAIGVGKTSLLHQYVHKTFYEEYQTTL 
GAS I LSKI I ILGDTTLKLQ I WDTGGQERVRSMVSTF YKGSDGC I 
LAFDVTDLESFEALDIWRGDVLAKIVPMEQSYPMVLLGNKIDLA 
BRKYQSILENHLTESIKLSPDQSRSRCC 


5465 




3348 


KGDPREFI RVHREALECDYVSAHLHEW I DL I FG YKQQGPAAVEA 
VKVFHHLFYEGQVDIYNINDPLKETATIGFINNFGQIPKQLFKK 
PHPPKRVRSRLNGDNAGISVLPGSTSDKIFFHHliDNLRPSLTPV 
KELKEPVGQIVCTDKGILAVEQNKVLIPPTWNKTFAWGYADLSC 
RLGTYESDKAMTVYECLSEKGQILCAICPNPKLVITGGTSTVVC 
VWEMGTSKEKAKTVTLKQALLGHTDTVTCATASLAYHIIVSGSR 
DRTC I IWDLNKLSFLTQLRGHRAP VSALCIUELTGD I VS CAGTY 
IHVWSINGNPIVSVNTFTGRSQQIICCCMSEMNEWDTQNVIVTG 
HSDGWRFWRMEFLQVPETPAPEPAEVLEMQEDCPEAQIGQEAQ 
DEDSSDSEADEQSISQDPKETPSQPSSTSHRPRAASCRATAAWC 
TDSGSDDSRRWSDQLSLDEKDGFIFVNYSEGQTRAHLQGPLSHP 
HPNPIEVRNYSRLKPGYRWERQLVFRSKLTMHTAFDRKDNAHPA 
EVTAIjGISKDHSRILVGDSRGRVFSWSVSDQPGRSAADHWVKDE 
GGDSCSGCSVRFSLTERRHHCRNCGQLFCQKCSRFQSEIKRLKI 
SSPVRVCQNCYYNLQHERGSEDGPRMC 


5466 


3 




KACAHASAHASGRLVRWWRKRRSVMGIC'i'SPVLL.ASLGVGLVTL 
L 3LAVGSYLVRRSRRPQVTLLDPNEKYLLRLLDKTTVSHNTKRF 
RFALPTAHHTLGLPVGKHIYLSTRIDGSLVIRPYTPVTSDEDQG 
YVDLVIKVYLKGVHPKFPEGGKMSQYLDSLKVGDWEFRGPSGL 
LTYTGKGHFNIQPNKKSPPEPRVAKKLGMIAGGTGITPMLQLIR 
AILKVPEDPTQCFLLFANQTEKDIILREDLEELQARYPNRFKLW 
FTLDHPPKDWAYSKGFVTADMIREHLPAPGDDVLVLLCGPPPMV 
OLACHPNLDKLGYSQKMRFTY 


5467 


2103 




3EALRVGTRGCRRDLPDPQARIFIQKKDLEEDESVTAAHLKSRG 
RSPRKIDQFCNSSNMVHGSVTFRDVAIDFSQEEWECLQPDQRTL 
YRDVMLENYSHLISLAGSSISKPDVITIiLEQEKEPWMWRKETS 
RRYPDLELKYGPEKVSPENDTSEVNLPKQVIKQISTTLGIEAFY 
FRBDSEYRQFEGLQGYQEGNINQKMISYEKLPTHTPHASLICNT 
HKPYECXECGKYFSCRSNI,IQHQSIHTGEKPYKCKECGKAFQLH 
IQLTRHQKFHTGEKTFECKECGKAFNLPTQLNRHKNIHTVKKLF 
ECKECGXS FNRS SNLTQHQS I HAGVKP YQCKE CGKAFNRGSNL I 
;i , ^VEKPFVCKECGMAFRYHYQLIEHCQIHTGEKPFECKE 
2GKAFTLLTKLVRHQKIHTGEKPFECRECGKAFSLLNQLNRHKN 
IHTGEKPFECKECGKSFNRSSNLVQHQSIHAGIKPYECKECGKG 
FNRGAHLIQHQKIHSNEKPFVCRECEMAFRYHCQLIEHSRIHTG 
DKPFECQDCGKAFNRGSSLVQHQS1HTGEKPYECKECGKAFRLY 
LQLSQHQKTHTGEKPFECKECGKFFRRGSNLNQHRSIHTGKKPF 
ECKECGKAFRLHMHLIRHQKLHTGEKPFECKECGKAFRLHMQLI 
RHQKLHTGEKPFECKECGKVFSLPTQLNRHKNIHTGEKAS 


5468 


225 


2976 


SFLTDLFQSLAQLENLCKQLYETTDTTTRLQAEKALVEFTNSPD 

CLSKCQLLLERGSSSYSQLLAATCLTKLVSRTNNPLPLEQRIDI ' 

RNYVLNYLATRPKLATFVTQALIQLYARITKLGWFDCQKDDYVF 

RtTAITDVTRFLQDSVEYCI IGVTILSQLTNEINQVSATAFLIEA 

DTTHPLTKHRKIASSFRDSSLFDIFTLSCNLLKQASGKNLNLND 

ESQHGLLMQLLKLTHNCLNFDFIGTSTDESSDDLCTVQIPTSWR 

SAFLDSSTLQLSTIGRCEYEKTCALLVQLFDQSAQSYQELLQSA 

SASPMDIAVQEGRLTWLVYIIGAVIGGRVSFASTDEQDAMBGEL 

VCRVLQLMNLTDSRLAQAGNEKLELAMLSFFEQFRKIYIGDQVQ 

KSSKLYRRLSEVLGLNDETMVLSVFIGKIITNL.KYWGRCEPITS 

KTLQLLNDLSIGYSSVRKLVKLSAVQFMLNNHTSEHFSFLGINN 

QSNLTDHRCRTTFYTALGRLLMVDLGEDEDQYEQFMLPLTAAFE 

AVAQMFSTNSFNBQEAKRTLVGLVRDLRGIAFAFNAKTS FMMLF 

EWIYPSYKPILQRAIELWYHDPACTTPVLKLMAELVHNRSQRLQ 
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SEQ 
ID 
NO: 


beginning 
nucleotide 

cor re sponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 
location 
corresponding 

residue of 


Amino acid segment containing signal peptide 
(AeAlanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Hietidine, I=Isoleucine, K=Lysine, 

P=Proline, Q=Glutaraine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nQcleotide insertion) 








fdvsspng;llpretskmitmygnriltlgev?kdqvyalklkg 
isicfsmlkaalsgsyvnfgvfrlygddaldnalqtfiklllsi 
phsdlldypklsqsyysllevltqdhmnfiaslephvimyilss 
isegltaldtmvctgccscldhivtylfkqlsrstkkrtrplnq 
esdrflhimqqhpemiqqmlstvlniiifedcrnqwsmsrpllg 
lillnekyfsdlrnsiwsqppekqqamhlcfenlmegiernll 
tknrdrftqnlsafrrevndsmknstygwsndmms 


5469 


134 


2G53 


DQEFETSLVPWHLPMGWLCSGLLFPVSCLVLLQVASSGNMXVLQ 
EPTCVSDYMSISTCEWKMNGPTNCSTELRLLYQLVFLLSEAHTC 
VPENNG3AGCVCHLLMDDWSADHYTLDLWAGQQLLWKGSFKPS 
EHVKPRAPGNLTVHTNVSDTLLLTWSxMPYPPDNYLYNHLTYAVN 
IWSENDPADFRIYNVTYLEPSLRIAASTLKSGISYRARVRAWAQ 
CYNTTWSEWSPSTKWHNSYREPFEQHLLLGVSVSCIVIIAVCLL 
CYVSITKIKKEWWDQIPMPARSRLVAIIIQDAQGSQWEKRSRGQ 

SAWCPVEISKTV1WPESISVVRCVELFEAPVECEEEEEVEEEKG 
SFCASPESSRDDFQEGREGIVARLTESLFLDLLGEENGGFCQQD 
MGESCLLFPSGSTSAHMPWDEFPSAGPKEAPPWGKEQPLHLEPS 
PEASPTQSPDNLTCTETPliVlAGNPAYRSFSNSLSQSPCPRELG 
PDPLLARHLEEVEPEMPCVPQLSEPTTVPQPEPETWEQILRRNV 
LQHGAAAAPVSAPTSGYQEFVHAVEQGGTQASAVVGIjGPPGEAG 
YKAFSSLLASSAVSPEKCGFGASSGEEGYKPFQDjIPGCPGDPA 
PVPVPLFTFGLDREPPRSPQSSHLPSSSPEHLGLEPGEKVEDMP 
KPPLPQEQATDPLVDSLGSGIVYSALTCHIiCGHrjKQCHGQEDGG 
QTPVMASPCCGCCCSDRASPPTTPI.RAPDPSPGGVPLEASLCPA 


5470 


17 


1413 


TACRIRTSLNRGIAAVKEDAVEMLASYGIjAYSLMKFFTGPMSDF 
KNVGLVFVWSKRDRTKAVLCMWAGAIAAVFHTLIAYSDLGYYI 
INKLHHVDESVGSKTRRAFLYLAAFPFMDAMAWTHAGILLKHKY 
SFLVGCASISDVIAQWFVAILLHSHLEGREPLLIPILSLYMGA 
LVRCTTLCLGYYKNIHDIIPDRSGPELGGDATIRKMLSFWWPLA 
LILATQRISRPIVNLFVSRDLGGSSAATEAVAILTATYPVGHMP 
YGWLTEIRAVYPAFDKNNPSKKLVSTSNTVTAAHIKKFTFVCMA 
LSLTLCFVMFWTPNVSEKI LIDI IGVDFAFAELCWPLRI FS FF 
PVPVTVRAHLTGWLMTLKKTFVLAPSSVLRIIVLIASLWLPYL 
GVHGATLGVGSLLAGFVGESTMDAIAACYVYRKQKKKMENESAT 
EGEDSAMTDMPPTEEVTDIVEMREENE 


5471 


1868 


658 


RSSAPPGP3RAAAATAAAAAAGVEMAAAAAQGGGGGEPRRTEGV 
GPGVPGEVEMVKGQPFDVGPRYTQLQYIGEGAYGMVSSAYDHVR 
KTRVAIKKISPFEHQTYCQRTLREIQILLRFRHENVIGIRDILR 
ASTLEANRDVYIVQDLMETDLYKLLKSQQLSNEHICYFIiYQILR 
GLKYIHSANVLHRDLKPSNLLINTTCDLKICDEGLARIADPEHD 
HTGFLTEYVATRWYRAPEIMLNSKGYTKSIDIWSVGCILAEMLS 
KRPIFPGKHYLDQLNHILGILGSPSQEDLNCIINMKARNYLQSL 
PSKTKVAWAKLFPKSDSKALDLLDRMLTFNPNKRITVEEALAHP 
YLEQYYDPTDEPVAEEPFTFAMELDDLPKERLKELIFQETARFQ 
PGVLEAP 


5472 


1469 


753 


UYVM iEYLSDEEVAVSIDRLCKANGRSPSIPFGTVRIPGRARVR 
DPQALWIFGYGSLVWRPDFAYSDSRVGFVRGYSRRFWQGDTFHR 
GSDKMPGRWTLLEDHEGCTWGVAYQVQGEQVSKALKYLNVREA 
VLGGYDTKEVTFYPQDAPDQPLKALAYVATPQNPGYLGPAPEEA 
lATQILACRGFSGHNLEYLLRVRDVMQLCGPQAQDEHuAAIVDA 
VGTMLPCFCPTEQALALV 


54 73 


3 


2119 


Ft'iXYKLLIQDLEDIEQRVPVMDAQYKIITKTAHLITKESPQEEG 
KEMFATNSKliKEQLTKVKECYS PLL YESQQLL I PLEELEKQMTS 
FYDSLGKINEIITVLEREAQSSALFKQKHQELLACQENCKKTLT 
LIEKGSQSVQKFVTLSNVLKHFDQTRLQRQIADIHVAFQSMVKK 
TGDWKKKVETNSRLMKKFEESRAELEKVLRIAQEGLEEKGDPEE 
LLRRHTE PFSQLDQRVLNAFLKACDELTD I LPEQEQQGLQEAVR 
KLHKQWKDLQGEAPYHLLHLKIDVEKNRFLASAEECRTELDRET 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 

residue of 


Amine acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E=> 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glut amine , R=Arginine, 
S=Serine, T= Threonine, V=Valine, 
W^Trypfcophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








KLM2QE3SEKIIKEHRVFFSDKGPHHLCEKRLQLIEELCVKLPV 
RDPVRDTPGTCHVTLXELRAAIDSTYRKLMEDPDKWKDYTSRFS 
E F3 SWI STNETQLKG I KGEAIDTANHGEVKRAVEEIRNGVTKRG 
ETLSWLKSRLKVLTEVSSENEAQKQGDEIAKLSSSFKALVTLLS 
E VEKMLSNFGDCVQYKE I VKNSLEEL I SGS KEVQEQAEK1 LDTE 
NLFEAQQLLLHHQQKTKRISAKKRDVQQQIAQAQQGEGGLPDRG 
HE3LRKLESTLDGLERSRERQERRIQVTLRKWERFETNKETWR 
YLFQTGSSHERFLSFSSLESLSSELEQTKEFSKRTESIAVQAEK 
LVKEASEIPLGPQNKQLLQQQAKSIKEQVKKLEDTLEEEYVIDK 
S 


5471 


2 


780 


TPDVRQLQASRRGIAVASWCSPRWFAGEEMAFVKSGWLLRQSTI 
LKRWKKNWFDLWSDGHHYYDDQTRQIIIEDKVHMPMDCI1IIRTG 
QECRDrQPPDGKSKDCMLQIVCRDGKTISLCAESTDDCLAWKFT 
LQDSRTNTAYVGSAVMTDETSWSSPPPYTAYAAPAPEVGRTLS 
LQQAYGYGPYGGAYPPGTQWYAANGQAYAVPYQYPYAGLYGQQ 
PANQVIIRERYRDNDSDLALGMLAGAATGMALGSLFWVF 


5475 


2 


50S 


argwleslsltcqttpppsspcllhspetfihtmppnltgyyrf 

3tfrnytvqfdvgvefeedlrsvdgrkcqtivtweeehlvcvqk 
gevpnrgwrhwlegemlyleltarhavceqvfrkvr 


547S 


192 


1457 


3DSMSLLDCFCTSRTQVESLRPF.KQSETSIHQYLVDEPTLSWSR 
PSTRASEVLCSTNVSHYELQVEIGRGFDNLTSVHLARHTPTGTL 
VTIKITNLENCNEERLKALQKAVILSHFFRHPNITTYWTVFTVG 
SWLWVTSPFMAYGSASQLLRTYFPEGMS3TLIRNILFGAVRGLN 
YLHQNGCIHRSIKASHILISGDGLVTLSGLSHLHSLVKHGQRHR 
AVYDFPQFSTSVQPWLSPELLRQDLHGYNVKSDIYSVGITACEL 
ASGQVPFQDMHRTQMLLQKLKGPPYSPLDISIFPQSESRMKNSQ 
SGVDSGIGESVLVSSGTHTVHSDRLHTPSSKTFSPAFFSLVQLC 
LQQDPEKRPSASSLLSHVFFKQMKEESQDSILSLLPPAYNKPSI 
SLPPVLPWTEPECDFPDEKDSYWEF 


5477 


3 


1044 


RGNSRLRYSHEDELQLPRLPELFETGRQLLDEVEVATEPAGSRI 
VQEKVFKGLDLLEKAAEMLSQLDLFSRNEDLEE IAS TDLKYLL V 

LPKTMHNSAEWHTAiaSSMAYPSLVAMASQRQAKIQRYKQKKELE 
HRLSAMKSAVESGQADDERVREYYLLHLQRWIDISLEEIESIDQ 
EIKILRERDSSREASTSNSSRQERPPVKPFILTRNMAQAKVFGA 
GYPSLPTMTVSDWYEQHRKYGALPDQGIAKAAPEEFRKAAQQQE 
EQEEKEEEDDEQTLHRAREWDDWKDTHPRGYGNRQNMG 


5478 


2 


835 


KTVRIWVPNVKGESTVFRAHTATVRSVHFCSDGQSFVTASDDKT 
VKVWATHRQKFLFSLSQHINWVRCAKFSPDGRLIVSASDDKTVK 
1 JDI °PFCVHSYCEHGGFVTYVDFHPSGTCIAAAGMDNTVKVW 
DVRTHRLLQHYQLHSAAVNGLS FHPSGNYL I TASSDSTLKILDL 
MEGRLLYTLHGHQGPATTVAFSRTGEYFASGGSDEQVMVWKSNF 
DIGDHGEVTKVPRPPATLASSMGWLTVSILEQRLTLEEDKLKQC 
LENQQLIMQRATP 


5479 


2 


835 


I "' nVPNVKGESTVFRAHTATVRSVHFCSDGQSFVTASDDKT 
VKVWATHRQKFLFSLSQHINWVRCAKFSPDGRLIVSASDDKTVK 
LWDKSSRECVHSYCEHGGFVTYVDFHPSGTCIAAAGMDNTVKVW 
DVRTHRLLQHYQLHSAAVNGLSFHPSGNYLITASSDSTLKILDL 
MEGRLLYTLHGHQGPATTVAFSRTGEYFASGGSDEQVMVWKSNF 
DIGDHGEVTKVPRPPATliASSMGNLTVSILEQRLTLEEDKLKQC 
LENQQLIMQRATP 


5480 




1952 


lT: i EA3 /Kl^ZjQAITDKRKIQEEISQKRLKIEEDKLKH 
QHLKKKALREKWLLDGISSGKEQEEMKKQNQQDQHQIQVLEQSI 
LP.LEKEIQDLEKAELQISTKEEAILKKLKSIERTTEDIIRSVKV 
EREERAEESIEDIYANIPDLPKSYIPSRLRKEINEEKEDDEQNR 
KALYAMEIKVEKDLKTGESTVLSSIPLPSDDFKGTGIKVYDDGQ 
KSVYAVSSNHSAAYNGTDGLAPVEVEELLRQASERNSKSPTEYH 
EPVYANPFYRPTTPQRETVTPGPNFQERIKIKTNGLGIGVNESI 
Hr-IKGlN-GLSEERGWtlFNHISPlPPVPHPRSVIQQAEEKLHTPQKR 



330 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to Eirst 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E- 
Glutatnic Acid, F=Phenylalanine , G=Glycir.e, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T= Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LMTPfoEESNVMQDKDAPSPKPRLSPRETIFGKSEHQNSSETCQE 
DEEDVRYNIVHSLPPDINDTEPVTMIPMGYQQAEDSEEDKKFLT 
GYDGIIHAELWIDDEEEEDEGEAEKPSYHPIAPHSGVYQPAKP 
TPLPRKRSEASPHEKHKS 


5481 


3 


1422 


NSPGSVCLCQCVCPSLLHCLPPLLLLLLLPLLLHESPQPPALRV 
VATSSDRtJFMNKHQKPVLTGQRFKTRKRDEKEKFEPTVFRDTLV 
QGLNEAGDDLEAVAKFLDSTGSRLDYRRYADTLFDILVAGSMLA 
PGGTRIDDGDKTKMTNHCVFSANEDHETIRITYAQVFNKLIRRYK 
YLEKAFEDEMKKLLLFLKAFSETEQTKLAMLSGILLGNGTLPAT 
ILTSLFTDSLVKEGIAASFAVKLFKAWMAEKDANSVTSSLRKAN 
LDKRLLELFPVNRQSVDHFAKYFTDAGLKELSDFLRVQQSLGTR 
KELQKELQERLSQECPIKEVVLYVKEEMKRNDLPETAVIGLLWT 
CIMNAVEWNKKEELVAEQALKHLKQYAPLLAVFSSQGQSELILL 
QKVQEYCYDNIHFMKAFQKI WLFYKADVLS ESAI LKWYKEAHV 
AKGKSVFLDQMKKFVEWLQNAEEESESEGEEN 


5482 


1492 


528 


THWMTGMCYAPHQVLSYINGVTTSKPGVSLVYSMPSRNLSLRL 
EGLQEKDSGPYSCS VNV^ K^I..}- i! '11, n PAPPS 
CRLQGVPHVGANVTLSCQSPRSKPAVQYQWDRQLPSFQTFFAPA 
LDVIRGSLS hTNLSSS MAG VYVCKAHNE VGTAQCNVTLE VSTGP 
GAAWAGAWGTLVGLGLLAGLVLLYHRRGKALEEPANDIKEDA 
IAPRTLPWPKSSDTISKNGTLSSVTSARALRPPHGPPRPGALTP 
TPSLSSQALPSPRLPTTDGAHPQPISPIPGGVSSSGLSRMGAVP 
VMVPAQSQAGSLV 


S483 


1 


788 


FrFJKGCRAGRGNESDYRKIiEEMHQRFLVSERSKDDLQLRIiTRA 
ENRIKQLETDSSEEISRYQEMIQKLQNVLESERENCGLVSEQRL 
KLQQENKQLRKETESLRKIALEAQKKAKVKISTMEHEFSIKERG 
FEVQLREMEDSNRNSIVELRHLLATQQKAANRWKEETKKLTESA 
EIRINNLKSELSRQKLHTQELLSQLEI4ANEKVAENEKLILEHQE 
KANRLQRRLSQAEERAASASQQLSVITVQRRKAA3LMNLENI 


5484 


3 


1997 


IMADMEDLFGSDADSEAERKDSDSGSDSDSDQENAASGSNASGS 
ESDQDERGDSGQPSNKELFGDDSEDEGASHHSGSDNHSERSDNR 
SEASERSDHEDNDPSDVDQHSGSEAPNDDEDEGHRSDGGSHHSE 

LQNSDDDEKMQNTDDEERPQLSDDERQQLSEEEKANSDDERPVA 
SDNDDEKQNSDDEEQPQLSDEEKMQNSDDERPQASDEEHRHSDD 
EEEQDHKSESARGSDSEDEVLRMKRKIIAIASDSBADSDTEVPKD 
N3GTMDLFGGADDISSGSDGSDKPPTPCQPVDENGLPQDQQEEE 
P I PETRI EVE I PKVNTDLGNDLYFVKLPNFLS VE PRPFDPQYYE 
D3FEDE3MLDEEGRTRLKLKVENTIRWRIRRDEEGNEIKESNAR 
IVKWSDGSMSLHLGNEVFDVYKAPLQGDHNHLFIRQGTGLQGQA 
VFKTKLTFRPHSTDSATHRKMTLSLADRCSKTQKIRILPMAGRD 
PECQRTEMIKKEEERLRASIRRESQQRRMREKQHQRGLSASYLE 
PDRYDEEEEGEESISLAAIKNRYKGGIREERARIYSSDSDEGSE 
SCKAQRLIiKAKKLTSDEVRPNLFNSRGLSCTQEPTALNEELTDQ 
AGTN 


5485 


161 


1074 


iV^SSNMDSEAHEKRPPILTSSKQDISPHITNVGEMKHYLCG 
CCAAFNNVAI T FP I QKVL FRQQL YG I KTRDAI LQLRRDG FRNL Y 
RGILPPLMQKTTTLALMFGLYEDLSCLLHKHVSAPEFATSGVAA 
VLAGTTEAIFTPLERVQTLLQDHKHHDKFTNTYQAFKALKCHGI 
GEYYRGLVPI LFRNGLSNVLFFGLRGP I KEHLPTATTHSAHLVN 
DFICGGLLGAMLGFLFFPINWKTRIQSQIGGEFQSFPKVFQKI 
WLERDRKLINLFRGAHLNYHRSLISWGIINATYEFLLKVI 




1404 


142 


IPGST2. SWSPAAARGLSVCRCCRLHPASAMDLFGDLPEPERS PR " 
PAAC-KEAQKGPLLFDDLPPASSTDSGSGGPLLFDDLPPASSGDS 
GSLiATSISQMVKTEGKGAKRKTSEEEKNGSEELVEKKVCKASSV 
IF3LKGYVAERKGEREEMQDAHVILNDITEECRPPSSLITRVSY 
FAVFDGHGGIRASKFAAQNLHQNLIRKFPKGDVISVEKTVKRCL 
LDTFKHTDEEFLKQASSQKPAWKDGSTATCVLAVDNILYIANLG 
DSRAILCRYNEESQKHAALSLSKEHNPTQYEERMRIQKAGGNVR 
DGRVLGVLEVSRSIGDGQYKRCGVTSVPDIRRCQLTPNDRFILL 
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Predicted end 



nucleotide 
> first 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 



AVSLEQIRGLQTPAPVPLPLQPCPSNCDMERVTLALLLLAGLTA 
LEANDPFANXDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGK 
CKCKSSQKQHSPVPEKAIPLITPGS ATTC 
AMAASGEPQRQWQEEVAAVWVGSCMTDLVSLTSRLPKTG3TIH 
GHKFFIGFGGKGANQCVQAARLGAMTS1WCKVGKDSFGNDYIEN 
LKQNDISTEFTYQTKDAATGTASI1VNMEGQNIIVIVAGANLLL 
NTEDLRAAANVISRAKVMVCQLEITPAXSLEALTMARRSGVKTL 
FNPAPAIADLDPQFYTLSDVFCCNESEAEILTGLTVGSAADAGE 
AALVLLKRGCQWIITLGAEGCWLSQTEPEPKHIPTEKVKAVD 
TTVSFKI 

GKGPVAAFIDQSNIFLTDPKIFLGQWREEPKMPLLLLGETEPLK 
LERDCR3PVEPWAAASPDLALACLCHCQDLSSGAFPNRGVLGGV 
LFPTVEMVIKVFVATSSGSIAIRKKQQEWGFLEANKIDFKELD 
IAGDEDNRRWMRENVPGEKKPQNG1PLPPQIFNEEQYCGDFDSF 
FSAKEENIIYSFLGLAPPPDSKGSEKAEEGGETEAQKEGSEDVG 
NLPEAQEKNEEEGETATEETEEIAMEGAEGEAEEEEETAEGEEP 
GEDEDS 

GKGPVAAFIDQSNIFLTDPKIFLGQWREF.PKMPLLLLGETEPLK 
LEKDCiiSPVEPWAA.o.SPDLAIACLCHCQDLSSGAFPNRGVI.GGV 
IiFPTVEMVIKVFVATSSGS IAIRKKQQEWGFLEANKIEFICELD 
lAGDEDNRRKMREOTPGEKKPQNGlPLPPQIFNEEQYCGDFDSF 
FSAKEENIIYSFLGLAPPPDCKGGEKAEEGGETEAQKEGSEDVG 
NLPEAQEKNEEEGETATEETEEIAMEGAEGEAEEEEETAEGKEP 
GEDEDS 

gsaprlslgptgaqardpdwwarppsrpytqskedrpdtegr.se 
qgdmassflpagaitgdsggelssgddsgevefphspeieetsc 
laelfekaaahlqgliqvasreqllylyarykqvkvgncntpkp 
sffdfegkqkweawkalgdsspsqamqeyiawkkldpgwnpqi 
pekkgkeantgfggpvisslyheetireedknifdycrennidh 
itkaiksknvdvnvkdeegrallhwacdrghkelvtvllqhrad 
incqdnegqtalhyasacefldivelllqsgadptlrdqdgclp 
eevtgcktvslvlqrhttgka 

asknplsavcttgimsslavrdpamdrslrsvfvgnipyeatee 
qlkdifsevgswsfrlvydretgkpkgygfceyqdqetalsam 
rnlngrefsgralrvdnaaseknkeelkslgpaap i idspygdp 
idpedapesitravaslppeqmfelmkqmklcvqnshqearnml 
lqnpqlayallqaqvvmrimdpeialkilhrkihvtplipgksq 
svsvsgpgpgpgpglcpgpnvllnqqnppapqpqklarrpvkdi 
pplmqtpiqggipapgpipaavpgagpgsltpggamqpqlgmpg 
vgpvplergqvqmsdprapiprgpvtpgglppr3llgdapndpr 
ggtllsvtgeveprgylgpphqgppmhhasghdtrgpsshemrg 
gplgdprlligeprgpmidqrglpmdgrggrdsrametramete 



MTGGIQG PGP INIGAGGPPQGPRQVPGI SGVGNPGAGMQGTGI Q 
GTGMQGAG I QGGGMQGAG I QGVS I QGGG I QGGG I QGAS KQGGS Q 
PSSFSPGQSQVTPQDQEKAALIMQVLQLTADQIAMLPPEQRQSI 
LILKEQIQKSTGAS 



RAPMMTKAVPEEPRF.PGRLTQALNSPLTWEHVWICVPGGTPDCL 
TDTFRVKRPHLRRSASWGHVPGTPVYREKEDMYDEIIELKKSLH 
VQKSDVDLMRTKLRRLEEENSRKDRQIEQLLDPSRGTDFVRTLA 
EKRPDASWVINGLKQRILKLEQQCKEKDGTISKLQTDMKTTNLE 
EKRIAMETYYEEVHRLQTLLASSETTGKKPLGEKKTGAKRQKKM 
GSALLSLSRSVQELTEEWQSLKEDLDRVLSTSPTISKTQGYVEW 
SKPRLLRKI VELEKKLSVMESSKSHAAEPVRSHPPACIASSSAL 
HRQPRGDRNKDHERLRGAVRDLKEERTALQEQLLQRDLEVKQLL 
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Predicted end 
nucleotid< 
location 

irresponding 
> first 

icid 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutaraine, R=Arginine, 
iidue of S=Serine, T=Threonine, V=Valine, 

mo acid W=Tryptophan, Y- Tyrosine, X=Unknown, *=Stop 

juence Codon, /^possible nucleotide deletion, 

\=possible nucleotide ins ert i on ) 

GHLTRTKLLASKAHGS EPPSVPGLPDQS S PVPRVPS P IAQATGS 
PVQEEAIVIIQSALRAHLARARHSAT3KRTTTAASTRRRSASAT 
HGDASSPPFIAALPDPSPSGPQAVAPLPGDDVNSDDSDDIVIAP 

SLPTKNFPV 

53S RSKAXI3TFTREVPSTDMKVRRESSSSLTHRPAPSPATPRLLGT" 
RRVLLGVSEGTGCADAMELVLVFLCSLLAPMVLASAAEKEKEMD 
PFHYDYQTLRIGGLVFAVVLFSVGILLILSRRCKCSFNQKPRAP 

GDE E AQVENL I TAMATE P QKAEN 

2168 DSLLLIQVDTMPFTLHLRSRLPSAIR3LILQKKPNIRNTSSMAG 
ELRPAS LWLPRS LAPAFERFCQVNTGPLPLLGQSE PEKWMLPP 
QGAISETRMGHPOFWKYEFGACTGSLASLEQYSEOLKDMVAFFL 
GCSFSLEEALEKAGLPRRDPAGHSQAGAYKTTVPCVTHAGFCCP 
LWTMRPIPKDKLEGLVRACCSLGGEQGQPVHMGDPELLGIKEL 
SKPAYGDAMVCPPGEVPVFWPSPLTSLGAVSSCETPLAFASIPG 
CTVMTDLKDAKAPP3CLTPERIPEVHHISQDPLHYSIASVSASQ 
ORELESMIGIDPGNRGIGHLLCKDELLKASLSL3HARSVLITT 
GFPTHFNHEPPEETDGPPGAVALVAFLQALEKEVAIIVDQRAWN 
LHQKIVEDAVEQGVLKTQ I P I LTYQGGSVEAAQAFLCKNGDPQT 
PRFDHLVAIERAGRAADGNYYNARKMNIiatLVDPIDDLFLAAKK 
IPGISSTGVGDGGNELGMGKVKEAVRRHIRHGDVIACDVEADFA 
VIAGVSNWGGYALACALYILYSCAVHSQYLRKAVGPSRAPGDQA 
WTQALPSVIKEEKMLGILVQHKVRSGVSGIVGMEVDGLPFHNTH 

AEMIQKLVDVTTAQV 

240 B aDTKMHEIYKGNZTPQLNKNTLK'l'CAATDVWAVYFSQFWIDYEG - 
MKSGKGRPISFVDSFPLSIWICQPTRYAESQKEPQTCNQVSLNT 
SQSESSDLAGRI.KRKKLLKEYYSTESEPLTNGGQKPSSSDTFFR 
FSPSSSEADIHLLVHVHKHVSMQINHYQYLLLLFLHESLILLSE 
NLRKDVEAVTGSPASQTSICIGILLRSAEIALLLHPVDQANTLK 
SPVSESVSPWPDYLPTEMGDFLSSKRKQISRDIWRIRSVTVNH 
MSDNRSMSVDLSHIPLKDPLLFKSASDTNLQKGISFMDYLSDKH 
LGKI S EDE SSGLVYKSGSGE I GSETSDKKDSF YTDS S S VLNYRE 
DSNILSFDSDGNQNILSSTLTSKGKETISSIFKAEDLLPEAASL 
SENLDISKEETPPVRTLKSQSSLSGKPKERCPPNLAPLCVSYKN 
KKRSSSQM3LDTISLDSMILEEQLLESDGSDSHMFLEKGNKKHS 
TTNYRGTAESVKAGANLQNYGETS PDAI S TKSEGAQENHDDLMS 
WVFKITGVNGE 1 D IRGEDTE I CLQVNQVTPDQLGNI SLRHYLC 
NRPVGSDQKAVIHSKSSPEISLRFESGPGAVIHSLLAEKNGFLQ 
CHIKKFSTSFLTSSLMNIQHFLEDETVATVMPKKIQVSNTKIUL 
KDDSPRSSTVSLEPAPVTVHIDHLWERSDDGSFH1RDSHMLNT 
GNDLKENVKSDSVLLTSGKYDLKKQRSVTQATQTSPGVPWPSQS 
AETFPEFSFDFTREQLMEEMESLKQELAKAKMALAEAHLEKDALL 

HHIKKMTVE 

3308 SISKLLKRRSNIDAYLLSNSCAFFAPRLFSbASQIIREQQSPHV'" 
CFIYKYSGFPSLECQCHFVSPHSSCYINFFSFPPPFFVCFQLSN 
GFSHYSLSSESHVGPTGAGLFPHCLPASRLLPRVTSVHLPEYAH 
YYTIGPGMFPSSQIPSWKDWAKPGPYDQPLVNTLQRRKEKREPD 
PNGGGPTTASGPPAAAEEAQRPRSMTVSAATRPGEEMEACEELA 
LALSRGLQLDTQRSSRDSLQCSSGYSTQTTTPCCSEDTIPSQVS 
DYDYFSVSGDQEADQQEFDKSSTIPRNSDISQSYRRMFQAKRPA 
STAGLFTTLGPAMVTPGVATIRRTPSTKPSTORGTIGAGPIPIK 
TPVIPVKTPTVPDLPGVLPAPPDGPEERGEHSPESPSVGEGPQG 
VTSMPSSMWSGQASVNPPLPGPKPSIPEEHRQAIPESEAEDQER 
EPPSATVSPGQIPESDPADLSPRDTPQGEDMLNAIRRGVKLKKT 

TTHDRSAPRFS 

1492 ILTHQ3IFTG3XPCECGKASIQMSHLSQQKIYSGENPFACKVCG 
KVFSHKSNLTEHEHFHTREKPFECNECGKAFSQKQYVIKHQN'iH 
TGEKLFECNECGKSFSQKENLLTHQKIHTGEKPFECKDCGKAFI 
QKSNLIRHQRTHT3EKPFVCKECGKTFSGKSNLTEHEKIHIGEK 
PFKCSECGTAFGQKKYLIKHQNIHTGEKPYECNECGFAFSQRTS 
LIVHVRIHSGDKPYECNVCGKAFSQSSSLTVHVRSHTGEKPYGC 
yECGKAFEQFSTLALHLRIHTGKKPYQCSECGKAFSQKSHHIRH 
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SEQ 
ID 
NO: 


beginning 
nucleotide 
location 
corresponding 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 

residue of 


Ammo acid segment containing signal peptide 
[A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline ( Q=Glutamine, R=Arginine, 
S=Serine, T= Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








QKIHTH 


5499 


324 


926 


GFGQI GRGHKI TTYPFS PRK3GRKGMAQSQGWVKRYIKAFCKGF 
FVAVPVAVTFLDRVACVARVEGASMQPSLNPGGSQSSDWLLNH 
WXVRNFEVHRGDIVSLVSPKNPEQKIIKRVIALEGDIVRTIGHK 
NRYVKVPRGHIOTEGDHHGHSFDSNSFGPVSLGLLHAHATHTLW 
PPERWQKLESVLPPERLPVQREEE 


5500 


1978 


128S 


KPDWRLQNLPPRLYLWRSSRFGFGHLKKRLQMDFKIEHTWDGFP 
VKHEPVF IRLNPGDRGVMMD I SAP FFRDPPAPLGEPGKP FNELW 
DYEWEAFFLNDITEQYLEV3LCPHGQHLVLLLSGRRNVWKQEL 
PLSFRVSRGETKWEGKAYLPWSYFPPNVTKFNSFAIHGSKDKRS 
YEALYPVPQHELQQGQKPDFHCLEYFKSFNFWTLLGEEWKQPES 
DLWLIEKCDI 


5501 


2S27 


2226 


CRPPV3A- 7 ) " i p 7_ i 7 u 2SSRPFSLP 
AAIMLALISRLLDWFRSLFWKEEMELTLVGLQYSGKTTFVNVIA 
SGQFSEDMIPTVGFNMRKVTKGNVTIKIWDIGGQPRFRSMWERY 
CRGVNAIVYM1DAADREK1 EASRNELHNLLDKPQLQGIPVLVLG 
NKRDLPIJALDEKQLIEKMNLSAIODREICCYSISCKEKDNIDIT 
LQWLIOHSKSRRS 


5502 


3 


824 


NSAFPVWVPERTALLTCPLGAAPGSSREAPG1AGPPNSTAMSKL 
GKFFKGGG3SKSRAAPSPQEALVRLRETEEMLGKKQEYLENRIQ 
REIALAKKHGTQNKRAALQALKRKKRFE KQLTQ IDGTLSTI E FQ 
REALENSHTNTEVLRNMGFAAKAMKSVHh'NMDLNKIDDLMQEIT 
EQQDIAQEISEAFSQRVGFGDDFDEDELMAELEELEQEELNKKM 
TNIRLPHVPSSSLPAQPNRKPGMSSTARRSRAASSQRAEEEDDD 
JK2LAAWAT 


5503 


216 . 


654 


J ( D DSHLGYFKMSFLLPKLTSKKEVDQAIKSTA 
EKVLVLRFGRDEDPVCLQLDDILSKTSSDLSKMAAIYLVDVDQT 
AVYTQYFDI SYI PSTVFFFNGQHMKVDYGGEDPALRS IKAVRRT 
SPAGTLGEKPVNS 


5504 


58 


3563 


QLSFSFQAPVTFDDITVYLLQEEWVLLSQQQKELCGSNKLVAPL- 
GPTVANPEIjFRKFGRGPEPWLGSVQGQRSLIjEHHPGKKQMGYMG 
EMEVQGPTRESGQSLPPQKKAYLSHLSTGSGHIEGDWAGRNRKL 
LKPRSIQKSWFVQFPWLIMNEEQTALFCSACREYPSIRDKRSRL 
I EGYTG P FKVETLKYHAKSKAHMFCVNALAARDPI WAARFRS IR 
DPPGDVLASPEPLFTADCPIFYPPGPLGGFDSMAELLPSSRAEL 
EDPGGDGAIPAMYLDCISDIiRQKEITDGIHSSSDINrLyNEAVB 
SCIQDPEAEGLSEEVPWFEELPWFEDVAVYFTREEWGMLDKR 
QKELYRDVMRMNYELLASLGPAAAKPDLISKLERRAAPWIKDPN 
RPPGNKKMVAVREADTQASAADSALLPGSPVEARASC 
CSSSI CEEGDGPRRIKRTYRPRS IQRSWFGQFPWLVIDPKETKL 
FCSACIERPNLHDKSSRLVRGYTGPFKVETLKYHEVSKAHRLCV 
NTVEIKEDTPHTALVPEISSDLMANMEHFFNAAYSIAYHSRPLN 
DFEKILQLLQSTGTVILGKYRNRTACTQFIKYISETLKREILED 
VRNSPCVSVLLDSSTDASEQACVGIYIRYFKQMEVKESYITLAP 
LYSETAEGYFETIVSALDELDIPFRKFGWWGLGTDGSAMLSCR 
GGLVE KFQE V I PQLLP VHC VAHRLHLAWDACGS I DLVKKCDRH 
IRTVFKFYQSSNKRLNELQEGAAPLEQEIIRLKDLNAVRWVASR 
RRTLHALLVSWPALARHLQRVAEAGGQIGHRAK3KLKLMRGFHF 
VKFCHFLLDFLS1YRPLSEVCQKEIVLITEWATLGRAYVALES 
LI k, .3PKEEEFNASFKDGRLHGICLDKLEVAEQRFQADRERTV 
LTGIEYLQQRFDADRPPQLKNMEVFDTMAWPSGIELASFGNDDI 
LNLARYFECSLPTGYSEEALLEEWLGLKTIAQHLPFSMLCKNAL 
AQHCRFPLLSKLMAVWCVPISTSCCERGFKAMNRIRTDERTKL 
SNEVLNMLMMTAVNGVAVTEYDPQPAIQHWYLTSSGRRFSHVYT 
CAQ VPARS PASARLR KEEMGAL Y VEEPRTQKP P I LP S REAAEVL 
XDCIMEPPERLLYPHTSQEAPGMS 


5505 


3312 


1219 


NCSPRSLSAAKMSNRNNNKLPSNLPQLQNLIKRDPPAYIEEFLQ"" 
CYNH ''XSN\'3IFKLQPNKPSKELAELVMFMAQISHCYPEYLSNF 
PQEVKDLLSCNHTVLDPDLRMTFCKALILLRKKNLINPSSLLEL 
FFELFRCHDKLLRKTLYTHIVTDIKNINAKHKKNKVNWLQNFiyi 
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SEQ 
ID 
NO: 


beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

sequence 


corresponding 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E=. 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 








YTMLRDSNATAAKKSLDVMIELYRRNIWNDAKTVNVITTACFSK 
VTKILVAALTFFLGKDEDEKQDSDSESEDDGPTARDIiLVQYATG 
KKS3KNKKKLEKAKKVLKKHRKKKKPEVFNFSAIHLIHDPQDFA 
EKLLKQLBCCKERFEVKMMLMNLISRLVGIHELFLFNFYPFLQR 
FLQPHQREVTKILLFAAQASHHI.VPPEIIQSLLMTVANNFVTDK 
KSGEVMTVGINAIKEITARCPLAMTEELLQDLAQYKTHKDKNVM 
MSARTLIHLFRTLNPQMLQKKFRGKPTEASIEARVQEYGELDAK 
DYIPGAEVLEVEKEENAENDEDGWESTSLSEEEDADGEWIDVQH 
SSDEEQQEISKKLNSMPMEERKAKAAAISTSRVLTQEDFQKIRM 
AQMRKELDAAPGKSQKRKYIEIDSDEEPRGELLSLRDIERDHKK 
PKSDKETRLATAMAGKTDRKEFVRKKTKTNPFSSSTNKEKKKQK 


5506 


l 


1531 


FRGDLCGQRGGSAPGEGGSSAWPAPAHPLPEREREREALCPGRS 
CSGGGGEETPGTTPVWSPLE3GGDEELRPNPYVRFPYRWWAVW 
LAAFPSLGAGGETPEAPPESWTQLWFFRFVTOAA3YASFMVPGY 
LLVQYFRRKNYLETGRGLCFPLVKACVFGNEPXASDEVPLAPRT 
EAAETTPMWQALKLLFCATGLQVSYLTWGVLQERVMTRSYGATA 
TSPGERFTDSQFliVLMNRVLALIVAGLSCVIiCKQPRHGAPMYRY 
SFASLSMVLSSWCQYEALKFVSFPTQVLAKASKVIPVMLMGKLV 
SRRSYEHWEYLTATLISIGVSMFLLSSGPEPRSSPATTLSGLIL 
LAGYIAFDSFTSNWQDALFAYKMSSVQMMFGVNFFSCLFTVGSL 
LEQGALLEGTRFMGRHSEFAAHALLLSICSACGQLFIFYTIGQF 
GAAVFTIIMTLRQAFAILLSCLLYGHTVTWGGLGVAWFAALL 
LRVYARGRLKQRGKKAVPVE3PVQKV 


5507 


3704 


1271 


PRGTRRCRPAGRASRRARRRPPCPGPAAPGSLEIGHFGTAAGKK 
VAVADVQFGPMRi'HQDQLQVLLVFTKEDMQCNGFCRAC:KKAGFK 
CTVTKEAQAVLACFLDKHHD HIT DHRNPRQLDA3ALCRS IRSS 
KLSENTVIVGWRRVDREELSVMPFISAGFTRRYVENPNIMACY 
NELLQLEFGEVRSQLKLRACNSVFTALENSEDAIHITSEDRFIQ 
YANPAFETTMGYQSGELIGKSLGEVPINEKKADLLDTINSCIRI 

NGNNKAEKI S E CVQSDTHTDNQTGKHKDRRKGS LDVKAVASRAT 
SVSSQRRHSSMARIHSMTIEAPITKVINIINAAQESSPMPVTEA 
IjDRVLEILRTTELYSPQFGAKDDDPHANDLVGGLMSDGLRRLSG 
neyvlstkntqmvssniitpislddvppriarameneeywdfdi 
FELE AATHNRPL 1 YLGLKMFARFG ICEFLHCSESTLRSWLQIIE 
ANYHSSNPYHKSTHSADVLHATAYFLSKERIKETLDPIDEVAAL 
IAATIHDVDHPGRTNSFLCNAGSELAILYNDTAVLESHHAALAF 
QLTTGDDKCNIFKNMERNDYRTLRQGIIDMVLATEMTKHFEHVN 
XFVNSINKPLATLEENGETDKNQEVINTMLRTPENRTLIKRMLI 
KCADVSNPCRPLQYCIEWAARISEEYFSQTDEEKQQGLPWMPV 
FDRNTCS I PKSQ I S FIDYFI TDMFDAWDAFVDLPDLMQHLDNNF 
KYWKGLDEMKLRNLRPPPE 


5508 


1151 


691 


LSSVFSRRSASMFAVGCSMGPFLHYWYLSLDRLFPASGLRGFPN 
VLKKVLVDQLVASPLLGVWYFLGLGCLEGQTVGESCQELREKFW 
EFYKADWCVWPAAQFVNFLFVPPQFRVTYINGLTLGWDTYLSYIj 
KYRSPVPLTPPGCVALDTRAD 


5509 


123 8 


619 


RKSRGCQNALSASGPAAAAAAIMVRKLKFHEQKLLKQVDFLNWE 
VTDHNLHELRVLRRYRLQRREDYTRYNQLSRAVRELARRLRDLP 
EEDQFRVRASAALLDKLYALGLVPTRGSLELCDFVTASSFCRRR 
LPTVLLKLRMAQHLQAAVAFVEQGHVRVGPDVVTDPAFLVTRSM 
EDFVTWVDSSKIKRHVLEYNEERDDFDLEA 


5510 




1195 


PAGAHLSSGSSEPLVEPGRGRVGARVKGERGLQASGSAPGRSKM 
AEGERQPPPDSSEEAPPATQNFIIPKKEIHTVPDMGKWKRSQAY 
ADYIGFILTLNEGVKGKKLTFEYRVSEAIEKLVALLKTLDRWID 
ETPPVDQPSRFGNKAYRTWYAKLDEEAENLVATWPTHLAAAVP 
EVAVYLKESVGNSTRIDYGTGHEAAFAAFLCCLCKIGVLRVDDQ 
IAIVFKVFNRYLEVMRKLQKTYRMEPAGSQGVWGLDDFQFLPFI 
WGS3QLIDHPYLEPRHFVDEKAVNENHKDYMFLECILFITEMKT 
GPFAEHSNQLWNISAVPSWSKVNQGLIRMYKAECBEKFPVIQHF 
KFGSLLPIHPVTSG 
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SEQ 
ID 
NO: 


Predicted 

nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


Predicted end 

corresponding 
to first 

sequence 


Amxno acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 


5511 


276 


1980 


i"Ji r I:;VT,W^PPENLIT3ISAVPISQKEEVADFQLSVDSLLEKDND 
HSRPBIQVQAKRLAEXLRCDTWSEISTGQRTVNFKINRELLTK 
TVLQQVIEDGSKYGLKSELFSGLPQKKIWEFSSPNVAKKFHVG 
HLRSTIIGNFIANLKEALGHQVIRINYLGDWGMQFGLLGTGFQL 
FGYEEKLQSNPLQHLFEVWQVNKEAADDKSVAKAAQEFFQRLE 
LGDVQALSLWQKFRD^SIEEYIRVYKRLGVYFDEYSGESFYREK 
SQBVLKLLESKGLLLKTlKGTAVVDhSGbTGDPSSICTVMRSDGT 
RI.YATRDLAAAIDRMDKYNFDTMIYVTDKGQKKHFQQVFQMLKI 
MGYDWAERCQHVPFGWQGMKTRRGDVTFLEDVLNEIQLRMLQN 
MASIKTTKELKNPQETAERVGLAALIIQDFKGLLLSDYKFSWDR 
VFQSRGDTGVFLQYTHARLHSLEETFGCGYLNDFNTACLQEPQS 
VSILQHLLRFDEVLYKSSQDFQPRHIVSYLLTLSHLAAVAHKTL 
QIKDSPPEVAGARLHLFKAVRSVLANGMKLLGITPVCRM 


5512 


120 




DPSLLL'i'l'rVTGVTVLVLVLKSMMSRRREPITLQDPEAKYPLPL 
IEKEKISHNTRRFRFGLPSPDHVLGLPVGNYVQLLAKIDNELW 
RAYTPVSSDDDRGFVDLIIKIYFKNVHPQYPEGGKMTQYliENMK 
IGETIFFRGPRGRLFYHGPGKLGIRPDQTSEPKKTLADHLGMIA 
GGTGITPMLQLIRHITKDPSDRTRMSLIFANQTEEDILVRKELE 
EIARTHPDQFDLWYTLDRPPIGWKYSSGFVTADMIREHLPPPAK 
STLILVCGPPPLIQTAAHPNLEKLGYTQDMIFTY 


5513 


2 


837 


AKWKL,L>KDSPKIPPAGAETPGRGSCRNYLPSSSPPPPEPSSFPS 

PPTSRGGPGSRDTMSDSEEESQDRQLKIVVIjGDGAaGKTSLTTC 

FAQETFGKQYKQTIGLDFFLRRITLPGNLNVTLQIWDIGGQTIG 

GKMLDKYIYGAQGVLLVYDITNYQSFENLEDWYTWKKVSEESE 

TQPLVALVGNKIDLEHMRTIKPEKHLRFCQENGFSSHFVSAKTG 

DSVFL^ ] LCIXi^lKAEIEOSQR j t 

RTVNPPRSSMCAVQ 


5514 


1295 


449 


\ , 1 1 ' 1- £ W TKGNFRGHALPGTFFFI IGLWWCTKS ILKYICKKQKRT 
CYLGSKTLFYRLEILEGITIVGMALTGMAGEQFIPGGPHLMLYD 
YKQGHWNQLLGWHHFTMYFFFGLLGVADILCFTISSLPVSLTKL 
MLSNALFVEAFIFYNHTHGREMLDIFVHQLLVLWFLTGLVAFL 
EFLVRITOVLLELLRSSLILLQ3SWFFQ.IGFVLYPPSGGPAWDLM 
DHENILFLTICFCWHYAVTIVIVGMNYAFITWLVKSRLKRLCSS 
EVGLLKNAEREQESEEEM 


5515 


1572 


260 


FVRLVGRGDCDPLLSVCLTTMPLYEGLGSGGEKTAWIDLGEAF 
TKCGFAGETGPRCIIPSVIKRAGKPKPVRWQYNINTEELYSYL 
KEFIHILYFRHLLVNPRDRRWIIESVLCPSHFRETLTRVLFKY 
F3VPSVLLAPSHLMALLTLGINSAM\'LDCGYRESLVrjPIYEGIP 
VLNCWGALPLGGKALHKELETQLLEQCTVDTSVAKEQSLPSVMG 
SVFEGVLEDIKARTCFVSDLKRGLKIQAAKFNIDGWWERPSPPP 
NVDYPLDGEKILHILGSIRDSWEILFEQDNEEQSVATLILDSL 
I QCP IDTRKQLAENLWI GGTSMLPGFLHRLLAE IRYLVE KPKY 
KKALG7KTFR1HTPPAKANCVAWLGGAIFGALQDILGSRSVSKE 
YYNQTGRIPDWCSLNNPPLEMMFDVGKTQPPLMKRAFSTEK 


5515 


3 


735 


NSREPPQA3PC-PSPRKSPTASSFLFPWRPLASSFWMGAQGAQES 
IKAMWRVPGTTRRPVT3ESPGMHRPEAMLLLLTLALLGGPTWAG 
1 _f 333\YFSTTEDYDHEITOLRV£VGLLLVKSVQVKLGDSW 
t 1 1 ^LGGNTQEVTLQPGEYITKVFVAFQAFLRGMVMYTSKDR 
YFYFGKLDGQISSAYPSQEGQVL.VG1YGQYQLLGIKSIGFEWNY 
PLEEPTTEP PVNLTYS ANS P VGR ' 


5517 


245 


499 


SEIYVAMRTDSSKMTDVESGVANFASSARAGRRNALPDIQSSAA 
TDGTSDLPLKLEALSVKEDAKEKDEKTTQDQLEKPQWEEK 


5518 


3 


1375 


DAWAoA' R r, I J.HMDF?CLWI,3LLLPLVAAIiDFNYHRQEGMEA 
' 1 ^QlTiTSSVTHLHSIGKSVKGRNLWVLWGRFPKEHRIGIP 
El V ^IMHGDETVGRELLLHLIDYLVTSDGKDPEITNLINSTR 
ZHIMPSKNPDGFEAVKXPDCYYSIGRENYNQYDLNRNFPDAFEY 
~ CPETVAVMKWLXrETFVLSAWLHGGAIiVASYPFDNGVQA 
TGALYSRSLTPDDDVFQYLAHTYASRNPNMKKGDECKNKMMFPN 
GVTNGYSWYPLQGGMQDYNYIWAQCFEITLELSCCKYPREEKLP 
SFSNKNKASLIEYIKQVHLGVKGQVFDQNGNPLPNVIVEVQDRK 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 

amino acid 


Predicted enc 
nucleotide 
location 
corresponding 
to first 
amino acid 

sequence 


I Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
p=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyroeine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








HICPYRTNKYGEYYLLLLPGSYIINVTVPGHDPHITKVIIPEKS 
QNFSALKKDI LLPFQGQLDS I PVSNPS CPMI PLYENLPDHSAAT 
KPSLFLFLVSLLHIFFK 


5519 


87 


477 


I KS XLNQQVEVQESEWRLTEAKGPTMGKESGWDSGRAAVAAWG 
GWAVGTVLVALSAMGFTSVGIAASSIAAKMMSTAAIANGGGVA 
AGSLVAILQSVGAAGLSVTSKVIGGFAGTALGAWLGSPPSS 


5520 


117 


943 


PTEGRQKVLKTFTVPRSALAMTKTSTCIYHFLVLSWYTFLNYYI 
SQEGKDEVKPKILANGARWKYMTLLNLLLQTIFY3VTCLDDVLK 
RTKGGKDIKFLTAFRDLLFTTIAFPVSTFVFIAFWILFLYNRDL 
IYPKVLDTVIPVWLNHAMHTFIFPITLAEWLRPHSYPSKKTGL 
TLLAAASIAYISRILWLYFETGTWVYPVFAKLSLLGLAAFFSLS 
YVFtASIYLLGEKLNHWKWVSVQILQRWRLESVGICFQWPDWKS 
PAKHQLVKNIR 


5521 




911 


KILNMQKSCEENEGKPQNMPKAEEDRPLEDVPQEAEGNPQPSEE 
GVSQEAEGNPRGGPNQPGQGFKEDTPVRHLDPEEMIRGVDELER 
LREEIRRVRNKFVMMHWKQRHSRSRPYPVCFRP 


5522 


1224 


63 7 


GSRPLGQRSREKMWVFGYGSLIWKVDFPYQDKLVGYITNYSRRF 
WQGSTDHRGVPGKPGRVVTLVEDPAGCVWGVAYRLPVGKEEEVK 
AYLDFRBKGGYRTTTVIFYPKDPTTKPFSVLLYIGTCDNPDYLG 
PAPLEDIAEQIFNAAGPSGRNTEYLFELANSIRNLVPEEADEHL 
FALEKLVKERLEGKQNLNCI 


5523 


3 


1280 


- F I - ATARRPVFDDKEDVNFDHFQILRAIGKGSFG' 
KVCIVQKRDTEKMYAMKYMNKQQCIERDEVRNVFRELEILQEIE 
HVFLVNLWYSFQDEEDMFMWDLLLGGDLRYHLQQNVQFSEDTV 
RLY I CEMALALD YLRGQH 1 1 HRDVKPDNILLDERGHAHLTDFNI 
ATI IKDGERATALSGTKPYMAPEI FHSFVMGGTGYSFEVDWWSV 
GVMAYELLRGWRPYDIHSSNAVESLVQLFSTVSVQYVPTWSKEM 
VALLRKLLTVNPBHRLSSLQDVQAAPALAGVLWDHLSEKRVEPG 
FVPWKGRLHCDPTFELEEMILESRPLHKKKKRLAKNK5RDNSRD 
SSQSENDYLQDCLDAIQQDFVTFNREKLKRSQDLPREPLPAPES 
RDAAEPVEDEAERSALPMCGPICPSAGS3 


5524 


85 


2313 


RERERDHRPGESSQGQSGAGGCFPSPTMELRCGGLLFSSRFDSG 
NLAHVEKVES LS SDGEGVGGGASALTSGI ASS PDYEFNVWTRPD 
CAETEFSKGKRSWFYFSVRGGMPGKLIKINIMNMNKQSKLYSQG 
MAPFVRTLPTRPRWERIRDRPTFEMTETQFVLSFVHRFVEGRGA 
TTFFAFCYPFSYSDCQELLNQLDQRFPENHPTHSSPLDTIYYHR 
ELLCYSLDGLRVDLLTITSCHGLREDREPRLEQLFPDTSTPRPF 
RFAGKRIFFLSSRVHPGETPSSFVFNGFLDFILRPDDPRAQTLR 
RLFVFKLIPMLNPDGWRGHYRTDSRGVNLNRQYLKPDAVLHPA 
IYGAKAVLLYHHVHSRLNSQSSSEHQPSSCLPPDAPVSDLEKAN 
-VLQNEAQCGHSADRHNAEAWKQTEPAEQIOjNSVWIMPQQSAGLE 
ESAPDTIPPKESGVAYYVDLHGHASKRGCFMYGNSFSDESTQVE 
NMLYPKLISLNSAHFDFQGCNFSEKNMYARDRRDGQSKEGSGRV 
A I YKASG I IHS YTLE CNYNTGRS VNS I PAACHDNGRAS PP P P PA 
FPSRYTVELFEQVGRAMAIAALDMAECNPWPRIVLSEHSSLTNL 
SAWMLKHVRNSRGLSSTLNVGVNKKRGLRTPPKSHHGLPVSCSE 
NTLSRARSF3TGTSAGGSSSSQQNSPQMKNSPSFPFHGSRPAGL 
PGLGS S TQKVTHRVLGPVRGKPVWEPLQHVFGCLGHCWGK 




105 


834 


SNTLDFERHLFIMGQQISDQTQLVINKLPEKVAKHVTLVRESGS 
LTYEEFLGRVAELNDVTAKVASGQEKHLLFEVQPGSDSSAFWKV 
WRWCTKINKSSGIVEASRIiyiNLYQFIQLYKDITSQAAGVLAQ 
SSTSEEPDENSSSVTSCQASLWMGRVKQLTDEEECCICMDGRAD 
LILPCAHSFCQKCIDKWSDRHRNCPICRLQMTGANESWWSDAP 
TEDDMANYILNMADEAGQPHRP 


5525 


3 


853 


RRPCNPVRAAKRTGAAARAPRGLEVTMLRVAWRTLS L I RTRAVT 
QVLVPGLPGGGSAKFPFNQWGLQPRSLLLQAARGYWRKPAQSR 
LDDDPPPSTLLKDYQNVPGIEKVDDWKRLLSLEMANKKEMLKI 
KQEQFMKKIVANPEDTRSLEARIIALSVKIRSYEEHLEKHRKDK 
AHKRYLLMS IDQRKKMLKNLRNTNYDVFEKI CWGLG IE YTFPPL 
YYRRAHRRFVTKKALCIRVFQETQKLKKRRRALKAAAAAQKQAK 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

residue of 
amino acid 
sequence 


Predicted end 

location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrcsine, X=tTnknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








RRNPDSPAKAIPKTLKDSQ 


5527 


3225 


565 


LLRKYLLHQNPLLLRHQPNRTCISFSATMKLKDTKSRPKQSSCG 
KFQTKGIKWGKWKEVKIDPNMPADGQMDDLVCFEELTDYQLVS 
PAKKPSSLFSKEAPKRKAQAVSEEEEEEEGKSSSPKKKIKLKKS 
KNVATEG TSTQKE FEVKD PELEAQGDDMVCDD PEAGEMTS ENLV 
QTAPKKKKNKGKKGLEPSQSTAAKVPKKAKTWIPEVHDQKADVS 
AWKDLFVPRPVLRALSFLGFSAPTPIQALTLAPAIRDKLDILGA 
AETGSGKTLAFAIFMIHAVLQWQKRNAAPPPSNTEAPPGETHTE 
AGAETRS PGKAEAESDALPDDTVIESEALPSDIAAEARAKTGGT 
VSDQALLFGDDDAGEGPSSLIREKPVPKQNENEEENLDKEQTGN 
LKQELDDKSATCKAYPKRPLLGLVLTPTRELAVQVKQHIDAVAR 
FTGIKTAI LVGGiMS TQKQORMLNRRPE I WATPGRLWELI KE KH 
YHLRNLRQLRCLWDEADRMVEKGHFAELSQLLEMLNDSQYNPK 
RQTLVFSATLTLVHQAPARILHKKHTKKMDKXAKLDLLMQKIGM 
RGKPKVIDLTRNEATVETLTETKIHCETDEKDFYLYYFLMQYPG 
RSLVFANSISCIKRLSGLLKVLDIMPLTLHACMHQKQRLRNLEQ 
FARLEDCVLLATDVAARGLDIPKVQHVIHYQVPRTSEIYVHRSG 
RTARATNEGLSLML IGPEDVINFKKI YKTLKKDEDI PLFPVQTK 
YMD WKER I RLARQ I E KS E YRN FQACLHNS W I EQAAAALE I ELE 
EDMYKGGKADQQEERRRQKQMKVLKKELRHLLSQPLFTESQKTK 
YPTQSGKPPLLVSAPSKSESALSCIiSKCKKKKTKKPKEPQPEQP 
QPSTSAN 


5528 


3 


895 


GPFLSACRMWGACKVKVHDSLAT1SITLRRYLRI,GATMAKSKFE 
YVRDFEADDTCLAHCWVWRLDGRWFHRFAEKHWFAKPNDSRAL 
QLMTKCAQTVME3LED1VIAYGQSDEYSFVFKRKTNWFKRRASK 
FMTHVASQFASSYVFYWRDYFEDQPLLYPPGFDGRVWYPSNQT 
LKDYLSWRQADCHINNLYNTVFWALIQQSGLTPVQAQGRLQGTL 
AADKNEILFSEFNINYNNEPPMYRKGTVLIWQKVDEVMTKEIKL 
PTEMEGKKMAVTRTRTKPCKPSHLPRAPCLRWL 


5529 


48 


64C 


TFRLVSAHLKTRKLINPEAAERRWRDNDSRQGWLSVKMQRVSGL 
LSWTLS.^VLWLSGLSEPGAARQPRIMEEKALEVYDLIRTIRDPE 
KPNTLEELEWSESCVEVQEINEEEYLVIIRFTPTVPHCSLATL 
IGLCLRVKLQRCLPFKHKLEIYISEGTHSTEEDINKQINDKERV 
AAAMENPNLREIVEQCVLEPD 


5530 


4541 


2fiOS 


AQIVHAISYCHKLHVGHRDLKPENWFFEKQGLVKLTDFGFSNK 
FQPGKKIjTTS CGSLAYSAPE I LLGDEYDAPAVD I WSLGVILFML 
VCGQPPFQEANDSETLTMIMDCKYTVPSHVSKECKDLITRMLQR 
DPKRRASLEEIENHPWLQGVDPSPATKYNIPLVSYKNLSEEEHN 
SIIQRMVLGDIADRDAIVEALETNRYNHITATYFLLAERILREK 
QEKEIQTRSASPSNIKAQFRQSWPTKIDVPQDIiEDDLTATPLSH 
ATVPQSPARAADSVLNGHRSKGLCDSAKKDDLPELAGPALSTVP 
PASLKPTASGRKCLFRVEED3EEDEEDKKPMSLSTQWLRRKPS 
VTNRLTSRKSAPVLMQIFEEGESDDEFDMDENLPPKLSRIiKMNI 
ASPGTVHKRYHRRKSQGRGSSCSSSETSDDDSESRRRLDKDSGF 
TYSWHRRDSSEGPPGSEGDGGGQSKPSNASGGVDKASPSENNAG 
GGSPSSGSGGNPTNTSGTTRRCAGPS1TSMQLASRSAGELVESLK 
LMSLCLGSQLHGSTKYIIDPQNGLSFSSVKVQEKSTWKMCISST 
GNAGQVPAVGGIKFFSDHMADTTTELERIKSKNLKNNVLQLPIjC 
EKTISVNIQRNPKEGLLCASSPASCCHVT: 


5531 


24 


515 


GSQPRAPRPRDSMERPEPELIRQSWRAVSRSPLEHGTVLFARLF 
ALEPDLLPLFQYNCRQFSSPEDCLSSPEFLDHIRKVMLVIDAAV 
TKVEDLSSLEEYLASLGRKHRAVGVKLSSFSTVGESLLYMLEKC 
LGPAFTPATRAAWSQLYGAWQAMSRGWDGE 


5532 


3395 


14 02 


S DWMWGKRKMII EDETEFCGEELLHSVLQCKSVFDVLDGEEMR 
RARTRANPYEMIRGVFFDNRAAMKMAMMDFVFDRMFTNPRDSYG 
KPLVKDREAELLYFADVCAGPGGFSEYVLWRKKWHAKGFGMTLK 
GPNDFKLEDFYSASSELFEPYYGSGGIDGDGDITRPSNISAFRN 
FVLDNTDRKGVHFLI4ADGGFSVEGQENLQE1LSKQLLLCQFLMA 
LSIVRTGGHFICKTFDLFTPFSVGLVYLLYCCFERVCLFKPITS 
RPANSERYWCKGLKVGIDDVRDYLFAVNIKLNQLRNTDSDWL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corre sponding 
to first 

sequence 


Predicted end 

location 
corresponding 
to first 
amino acid 

amino acid 


Ammo acid segment containing- signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, P=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=I,eucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 

W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








WPLSVIKGDHEFTDYMIESNESHCSLQIKALAKIHAFVQDTTL 
SEPRQAEIRKECLRLKGIPDQARVAPSSSDPKSKFFELIQGTEI 

YTWDGRQSDRWI KLDLKTELPRDTLLSVE IVHELKGEGKAQRKI 
SAIHILDVLVLNGTDVREQHFNQRIQIiAEKFVKAVSKPHRPDMN 
PIRVKEVYRLEEMEKIFVELEMKIIKGSSGTPKLSYTGRDDRHF 
VPMGLYIVRTVNEPWTMGFSKSFKKKFFYNKKTKDSTFDIjPADS 
iapfhicyygrlfwewgdgirvhdsqkpqdqdklskedvlsfiq 


5533 


94 


78S 


MKBRRAPQPWARCKLVLVGDVQCGKTAMLQVLAKDCYPETYVP 
TVFENYTACLETEEQRVELSLWDTSGSPYYDJWRPLCYSDSDAV 
LLCFDISRPETVDSALKKWRTEIT.DYCPSTRVLLIGCKTDLRTD 
LSTLMELSHQKQAPISYEQGCAIAKQLGPEIYLEGSAFTSEK3I 

FKKEKAKXCSIM 


5534 


3 


605 


LVKGRARAANPGRVGAMDGLRQRVEHFLEQRNLVTEVLGALEAK 
TGVEKRYLAAGAVTLLSLYLLFGYGASLLCNLIGFVYPAYASIK 
AIESPSKDDDTWLTYWVVYALFGLAEFFSDLLLSWFPFYYVGK 

RALDAAAGITRNVKPSQTPQPKDK 


5535 


1029 


332 


KSFMDSEARLCSLVELSDTQDETQKSDSENEDLKIDCLQESQBL 
NLQKLKNSERILTEAKQKMRELTVNIKMKEDLIKEIjIKTGNDAK 
SVSKQYTLKVTKLEHDAEQAKVELTETQKQLQELENKDLSDVAM 
KVKLQKEFRKKVDAAKLRVQVbOKKQQDSKKLASLSIQNEKRAN 
ELEQSVDHMKYQKIQLQRKLQEENEKRKQLDAVIKRDQQKIKVI 
LSYIPAKYNMKC 


5535 


942 


282 


AAATAASLSPEGCRLRTPSSDVSPSRAPPPSAAPLPTGRAQMSP 
SGRLCLLTIVGLILPTRGQTLKDTTSSSSADATIMDiaVPTRAP 
DAVYTELQPTSPTPTWPADETPQFQTQTQQLEGTDGPIjVTDPET 
HKSTKAAHPTDDTTTLSERPSPSTDVQTDPQTLKPSGFHEDDPF 
FYDEHTLRKRGLLVAAVLFITGIIILTSGKCRQLSRLCRNHCR 


5537 




2331 


RARVSS P QLRV FRSGRPRRLRVLR I NRTS VALRLAGTGR F VAKT 
PGHPGSWEMGLLTFRDVAVEFSLEEWEHLEPAQKNLYQDVMLEN 
YRNLVSLGLWSKPDLITFLEQRKEPWNVKSEETVAIQPDVFSH 
YNKDLLTEHCTEAS FQKVI SRRHGS CDLENLHLRKRWKREECEG 
HNGCYDEKTFKYDQFDESSVESLFHQQIL3SCAKSYNFDQYRKV 
FTHSSLLNQQEEIDIWGKEHIYDKTSVLFRQVSTLNSYRNVFIG 

MHGQEKQEQSYKCNKCVEVCTQSLKHIQHQTIHIRENSYSYNKY 
DKDLSQSSNLRKQIIHNEEKPYKCEKCGDSLNHSLHLTQHQIIP 
TEEKPYKWKECGKVFNLNCSLYLTKQQQIDTGENLYKCKACSKS 
FTFJ5SNLIVHQRIHTGEKPYKCKECGKAFRCSSYLTKHKRIH"G 
EKPYKCKECGKAFNRS3CLTQHQTTHTGEXLYKCKVCSKSYARS 

KCKVCAKPFTCFSNLIVHERIHTGEKPYKCKECGKAFPYSSHLI 
RHHRIHTGEKPYKCKACSKSFSDSSGLTVHRRTHTGEKPYTCKE 
CGKAFSYSSDVIQHRRIHTGQRPYKCEECGKAFNYRSYLTTHQR 
SHTGERPYKCEECGKAFNSRSYLTTHRRRHTGERPYKCDECGKA 
FSYRSYLTTHRRSHSG3RPYXCEECGKAFNSRSYLIAHQRSHTR 
EKL 


5538 


92S 




. IPWGSIPVLMjLLLLGLIDISQAQLSCTGPPAIPGIPG " 
IPGTPC-PDGQPGTPGIXGEKGLPGLAGDHGEFGEKGDPGIPGNP 
GKVGPKGPMGPKGGPGAPGAPGPKGESGDYKATQKIAFSATRTI 
NVPLRRDQTIRFDHVITNMNWNYEPRSGKFTCKVPGLYYFTYHA 
SSRGNLCVNLMRGRERAQKWTFCDYAYKTFQVTTGGMVLKLEQ 
IEN 2A"Di i.LLGWEGANSIFSGFLLFPDMEA 


5539 


38 


1258 


HRGPSGAAAPGCALPRGQALEGPRSCRRPQPMARRYDELiPHYPG 
IVDGPAALASFPETVPAVPGPYGPHRPPQPLPPGLDSDGLKREK 
DEIYGHPLFPLLALVFEKCELATCSPHDGAGAGLGTPPGGDVCS 
SDSFNEDIAAFAKQVRSERPLFSSNPELDNLVIQAIQVLRFHLL 
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SEQ 
ID 
WO: 


Predicted 
beginning 
nucleotide 

corresponding 

residue of 
amino acid 
sequence 


Predicted end 

to first 
amino acid 
residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 

L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide delebion, 
\=possible nucleotide insertion) 








' 'l i r "'HJLCDNFCHRYITCLK3KMPIDLVIEDRDGGCREDFEDY 
PASCPSLPDQNNMWIRDHEDSGSVHLGTPGPSSGGLASQSGDNS 
SDQGDGLDTSVASPSSGGEDEDLDQERRRNKKRGIFPKVATNIM 
RAWL7QHLSKPYPSEEQKKQLAQDTGLTILQVNNWFINARRRIV 
QPMIDQSNRTGQGAAFSPEGQPIGGYCETQPHVAVRPPGSVGMS 
LNLEGEWHYL 


5540 


148 


1440 


PP1GAGAGVHARSPHPARRLPLTTAGVGGRAPDLLPTPWRQHRG 
PSGAAAPGCALPRGQALEGPRSCRRPQPMARRYDELPHYPGIVD 
GPAALASFPETVPAVPGPYGPHRPPQPLPPGLDSDGLKREKDEI 
YGHPLFPLLALVFEKCELATCSPRDGAGAGLGTPPGGDVCSSDS 
FKEDNTAFAKQVRSERPLFSSNPELDNLMIQAIQVLRFHLLELE 
Xr-XMPIDLVIEDRDGGCREDFEDYPASCPSLFDQNNIWIKDHED 
SGSVHLGTPGPSSGGLASQSGDNSSDQGVGLDTSVASPSSGGED 
EDLDQEPR^NKKRGIFPKVATNIMRAWLFQHLSKPYPSEEQKKQ 
LAQDTGLT I LQ VN1IWF I NARRR I VQPMI DQSNRTGQGAAFS PEG 
QPIGGYTETEPHVAFRAPASVGDEFGTRKEEWHYL 


5541 


148 


1440 


PPLGAGAGVHARSPHPARRLP1 TTi. DLLPl't U T 
PSGAAAPGCALPRGQALEGPRSCRRPQPMARRYDELPHYPGIVD 
GPAALASFPETVPAVPGPYGPHRPPQPLPPGLDSDGLKREKDEI 
YGHPLFPLLALVFEKCELATCSPRDGAGAGLGTPPGGDVCSSDS 
FNEDNTAFAKQVRSERPLFSSNPELDNLMIQAIQVLRFHLLELE 
KGKMPIDLVIEDRDGGCREDFEDYPASCPSLPDQNHIWIRDHED 
SGSVHLGTPGPSSGGLASQSGDNSSDCGVGLDTSVASPSSGGED 

LAQDTGLTILQVMNWFIWARRRIVQPKIDQSWRTGQGAAFSPEG 
QP 1 GGYTETEPHVAFRAPAS VGDE FGTRKEEWHYL 


5542 


148 




PPGGAGAGVHARSPHPARRI.PLTTAGVGGRAPDLLPTPWRQHP.G''' 

PSGAAAPGCALPRGQALEGPRSCRRPQPMARRYDELPHYPGIVD 

GPAALASFPETVPAVPGPYGPHRPPQPLPPGLDSDGLKREKDEI 

YGHPLFPLLALVFEKCELATCSPRDGAGAGLGTPPGGDVCSSDS 

FNEDNTAFAKQVRSERPLFSSNPELDNLMIQAIQVLRFHLLELE 

KGKMPIDLVIEDRDGGCREDFEDYPASCPSLPDQNNIWIRDHED 

SGSVHU3TPGPSSGGIASQSGDNSSDQGVGLDTSVASPSSGGED 

EDLDQEPRRNKKRGIFPKVATNIMRAWLFQHLSHPYPSEEQKKQ 

LAQDTGLT1LQVNNWFINARRRIVQPMIDQSNRTGQGAAF3PEG 

QP I GGYTETEPHVAFRAPAS VGDEFGTRKEEWHYL 


5543 


2405 


6S5 


RKVREQPWPLRTSEAVKTPALRPFPGPRGVSPFPKPDWGKSPAP~ 

KR ? FSDSGAFWS PERRPG VLEAPRRRPVPAS FRAVPPKPTRVHG 

SSASRDRVLARTMIVADSECRAELKDYLRFAPGGVGDSGPGEEQ 

KESRARRGPRGPSAFIPVEEVLREGAESLEQHLGLEALMSSGRV 

ENLAWMGLHPDYFTSFWRLHYLLLHTDGPLASSWRHYIAIMAA 

ARHQCSYLVGSHMAEFLQTGGDPEWLLGLERAPEKLRKLSEINK 

LLAHRPWLITKEHIQALLKTGEHTWSLAELIQALVLLTHCHSbS 

SFVFGCGILPEGDADGSPAPQAPTPPS3QSSPPSRDPLNHSGGF 

ESARDVEALMERMQQLQESLLRDEGTSQEEMESRFELEKSESLL 

VTPSADILEPSPHPDMLCFVEDPTFGYEDFTRRGAQAPPTFRAQ 

DYTWEDHGYSLIQRLYPEGGQLLDEKFQAAYSLTYNTIAMHSGV 

DTSVLRRAIWNYIHGVFGIRYDDYDYGEVNQLLERNLKVYIKTV 

ACYPEKTTRRMYNLFWHHFRHSEKVIIVNLLLLEARMQAALLYAr. 

RAITRYMT 


5544 


1895 




LGGLL.GRQRLLIiRMGAGRLGAPMERHGRASATSVSSAGEQAAGD 
PEGRRQEPLRRRASSASVPAVGASAEGTRRDRLGSYSGPTSVSR 
QRVSSLRKKRPIiFPWFGLDIGGXLVKLVYFEPKDITAEEEEEEV 
ESLKSIRKYLTSNVAYGSTGIRDVHLELKDLTLCGRKGNLHFIR 
FPTHDMPAFIQMGRDKNFSSLHTVFCATGGGAYKFEQDFbTIGD 
LQLCKLDELDCLIKGILYIDSVGFNGRSQCYYFENPADSEKCQK 
LPFDLKNPYPLLLVNIGSGVSILiAVYSKDNYKRVTGTSLGGGTF 
>1 GLLT3CTTFEEALEMASRGDSTKVDKLVHD1YGGDYERFG 
LPGWAVASSFGNMMSKEKREAVSKEDLARATLITITNNIGS1AR 
MCALNENrNQWPVGNFLRINTIAMRLLAYALDYWSKGSLKALF 
SEHEGYFGAVGALLELLKIP 
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corresponding 
to first 

residue of 
amino acid 



Predicted end 
nucleotide 

corresponding 
amino acid 



Amino acid segment containing signal peptide 
[A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 



GANMSAGRGGAAWPVLLGLLLALLVPGGGA&KTGAELVTCGSVL 
KLLNTHHRVRLHSHDIKYGSGSGQQSVTGVEASDDANSYNRIRG 
GSEGGCPRGSPVRCGQAVRLTHVLTGKNLHTHHFPSPLSNNQEV 
SAFGEDGEGDDLDLWTVRCSGQHWEREAAVRFQHVGTSVFLSVT 
GEQYGS PIRGQHEVHGMPSANTHNTWKAMEGI FIKPSVE PSAGH 
DEL 

fvprgghssT^gqsgrspSqkraraqaqlrnleayaanphsfvft 
rgctgrnirqlsldvrrvmepltasrlqvrkknslkdcvavagp 
lgvthflilsktetnvyfklmrlpggptltfqvkkyelvrdws 
sijrrhrmheqqfa-lppllvlinsfgphgmhvklmatmfqnlfpsi 
nvhkvnlntikrcllidyijpdsqeldfrhysikvvpvgasrgmk 
kllqekfpmmsrlqdisellatgaglseseaepdgdhnitelpq 
avagrgnmraqqsavrlteigprmtlqlikvqegvgegkvmfhs 
fvs kteeelqai leakekklrlkaqrqaqqaqnvqrkqeqreah 
rkkslegmkkarvggsdeeas3ipsrtaslelgeddeeqedddi 
eyfcqavgeapsedlfpeakqkrlakspgrkrkrwemdrgrgrl 

iRPGKRVA 



fc'VPRGUHSKM ~» i "^i^ANPIISFVFT 

RGCTGRNIRQLSLDVRRVMEPLTASRLQVRKKNSLKDCVAVAGP 
LGVTHFLILSKTETNVYFKLMRLPGGPTLTFQVKKYSLVRDWS 
SLRRHRMHEQQFAHPPLLVLNSFGPHGMHVKLIVL^TMFQNLFPSI 
NVHKVNLNTIKRCLLIDYNPDSQELDFRHYSIKWPVGASRGMK 



DQTGPPETIAF TPPR5TMEPLCPLLLVGFSLPLARALRGMETTA 
DSNETTTTSGPPDPGASQPLLAWLLLPLLLLLLVLLLAAYFFRF 
RKQRKAWSTSDKKMPNGILEEQEQQRVMLLSRSPSGPKKYFPI 
PVEHLEEEIRIRSADDCKQFREEFNSLPSGHIQGTFELANKEEN 
REKNRYPNILPNDHSRVILSQLDGIPCSDYINASYI0GYKEJOJK 
FIAAQGPKQETVNDFWRMVWEQKSATIVMLTNLKERKEEKCHQY 
WPDQGCWTYGNIRVCVEDCWLVEYTIRKFCIQPQLPDGCKAPR 
LVSQLHFTSWPDFGVPFTPIGMLKFLKKVKTLNPVHAGPIWHC 
SAGVGRTGTFIVIDAMMAMMHAEQKVDVFEFVSRIRNQRPQMVQ 
TDMQYTFIYQALLEYYLYGDTELDVSSLEKHLQTMHGTTTHFDK 
I GLEEEFRKLTNVRIMKENMRTGNLPANMKKARVIQI I PYDFNR 
VILSMKRGQE YTDY INAS F I DGYRQKDYFIATQGPLAHTVEDFW 
RM I WEWKSHTI VMLTEVQEREQDKCYQYWPTEGSVTHGE IT I E I 
KNDTLSEAISIRDFLVTLNQPQARQEEQVRWRQFHFHGWPEIG 
I PAEGKGMIDLIAAVQKQQQQTGNHP I TVHCSAGAGRTGTF IAL 
SNILERVKAEGLLDVFQAVKSIiRLQRPHMVQTLEQYEFCYKWQ 
DFIDIFSDYAKFK 



FEATGGKRLAFKMAGTARKDREMAIQAKKKLTTATDPIERLRLQ 
CLARGSAGIKC-LGRVFRIMDDDNNRTLDFKEFMKGLNDYAWME 
KEEVEELFQRFDKDGNGTIDFNEFLLTLRPPMSRARKEVIMQAF 
RKLDKTGDGVITIEDLREVYNAKHHPKYQNGEWSEEQVFRKFLD 
NFDSPYDKDGLVTPEEFMNYYAGVSAS IDTDVYFI 1MMRTAWKL 



rPKVPESYVETSASGGTV 
SLIAFTTWALLTIMEFSVYQDTWMKYEYEVDKDFSSKLRINIDI 
TVftMKCQYVGADVLDrjAETMVASADGLVYEPTVFDI.SPQQKEWQ 
RMLQLIQSRLQEEHSL3DVIFKSAFKSTSTALPPREDDSSQSPN 
ACRIHGHL YVNKVAGNFH I T VGKAI PH PRGHAHLAAL VNHE S YN 
7GHRIDHLSFGELVPAIINPLDGTEK1AIDIINQMFQYFITWPT 
KLHTYKI SADTHQFS VTERER I INHAAGSHGVSGIFMKYDLSSL 
MVTVTEEHMPFWQFFVRLCGIVGGIFSTTGMLHGIGKFIVEIIC 

CRFRLGSYKPWSVPFBDGHTDNHL-PLLBNHTH 

KQRDHTMDYKESCPSVSIPSSDEHREKKKRFTVYKVLVSVGRSE 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 

amino acid 

amino acid 


Ammo acic segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, 3= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=?hreonine, V»Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








I E FYAEFDKLYNTLKKQFPAMALKIPAKRIFGDNFEPDFIK 
QRRAGLNEFIQNLVRYPELYNHPDVRAFLQMDSPKHQSDPSEDE 
D3RSSQKLHSTSQNINLGPSGHPHAKPTDFDFLPCVIGKGSFGKV 
LLAKRKLDGKFY71VKVLQKKIV1.NRKEQKHIMAERNVLLKNVKH 
PFLVGLHYSFQTTEKLYFVLDFVNGGELFFHLQRERSFPEHRAR 
FYAAEIASALGYLHSIKIVYEDLKPENILLDSVGHWLTDFGLC 
KEGIAISDTTTTFCGTPEYLAPEVIRKQPYUNTVDWWCLGAVLY 
EMLYGLPPFYCRDVAEMYDNTLHKPLSLRPGVSLTAWSILEELL 
EKDRQNRLGAKEDFLEIQNHPFFESLSWADLVQKKIPPPFNPNV 
AGPDDIPJIFDTAFTEETVPYSVCVSSDYSXVNASVLEADDAFVG 
FSYAPPSEDLFL 


5552 


2 74 8 


930 


LGPAAGAAI1 i u H £ r|t .SSYEEYADKPLEKPLKLVLKVGG 
SEVTELSGSGHDSSYYDDRSDHERERHKEKKKKKKKKSEKEKHL 
DDEERRKRKEEKKRKREREHCDTiiGKADDFDPGKKVEVEPPPDR 
PVRACRTQPASNESTPIQQLLEHFLRQLQRIOPHGFFAFPVTDA 
IAPGYSMI I KHPMDFGTMKDKI VANEYKSVTEFKADFKLMCDNA 
MTYNRPDTVYYKLAKKILHAGFKI-IMSKQAALLGNEDTAVEEPVP 
EWPVQVETAKKSKKPSREVISCMFEPEGNACSLTDSTAEEHVL 
ALVEHAADEARDRINRFLPGGKMGYLKRNGDGSLLYSWNTAEP 
DADEEETHPVDLSSLSSKLLPGFTTLGFKDERRNKVTFLSSATT 
ALSMQNNSVFGDLKSDEMELLYSAYGDETGVQCALSLQEFVKDA 
GSYSKICWDDLLDQITGGDH£RTLFQLKQRRNVPMKPPD3AKVG 
DTLGDSS S S VLEFMSMKS YPDVSVDI SMLS SLGKVKKELDPDDS 
HLNLDETTKLLQDLHEAQAERGGSRPSSNLSSLSNASERDQHHL 


GSPSRLSVGEQPDVTHDPYEFLQSPEPAASAKT 


5553 




1095 


I JREA YL* 3RMDGPVAEHAKQEPFHVV1 L_, 3 I 
VFLKCENVQPSGSFKIRGIGHFCQEMAKKGCRHLVCSSGGNAGI 
AAAYAARKLGIPATIVLPESTSLQWQRLQGEGAEVQLTGKVWD 
EANLRAQELAKRDGWENVPPFDHPLIWKGHASLVQELKAVLRTP 
PGALVIJWGGGGLLAGWAGLLEVGWQHVP1IAMETKGAHCFNA 
AITAGKLVTLPD1TSVAKSLGAKTVAARALECMQVCKIHSEWE 
DTEAVSAVQQLLDDERMLVEPACGAALAAIYSGLLRRLQAEGCL 
PPSLTSVWIVCG3NNINSRELQALKTHI.GQV 


5554 


166 


2318 


CSGRTGGRGSLRPAENVCLTCKLSGAETRGLLCPALRTWIMKVL 
3RSFFWVLFPVLPWAVQAVEHEEVAQRVIKLHRGRGVAAMQSRQ 
WVRDSCRKLSGLLRQKNAVLWKLKTAIGAVEKDVGLSDEEKLFQ 
VHTFEIFQKELNESENSVFQAVYGLQRALQGDYKDWNMKESSR 
QRLEALREAAIKEETEYMELLAAEKHQVEALKNMQHQNQSLSML 
DEILEDVRKAADRLEEEIEEHAFDDNKSVKGVNFEAVLRVEEEE 
ANS XQ1J I TKRE VEDDLGLS ML IDS QNNQ YILTKPRDSTI PRADH 
HFIKDIVTIGMLSLPCGWLCTAIGLPTMF3YIICGVLLGPSGLN 
3IKSIVQVETLGEFGVFFTLFLVGLEFSPEKLRKVWKISLQGPC 
YMTLLMIAFGLLWGHLLRIKPTQSVFISTCLSLSSTPLVSRFLM 
GSARGDKE 3D 1 D YS T VLLGMLVTQDVQLGLFMAVMPTL IQAGAS 
ASSSIWEVLRILVLIGQILFSLAAVFLLCLVIKKYLIGPYYRK 
LH ! £ LGi>IKS I L ILGISAFI FLMLTVTELLDVSMELGCFLAGAL 
VSSQGPWTEEIATSIEPIRDFDAIVFFA8IGLHVFPTFVAYEL 
TVLVFLTLSWVMKFLLAALVLSLILPRSSQYIKWIVSAGLAQV 
S3FSFVLGSRARRAGVISREVYLLILSVTTLSLLLAPVLWRAAI 
TRCVPRPERRSSL 


5555 


212 




LSLRTRETPAPPRCEAASQGRVGWRADAAAEEAVRSVWNRTRDR 
•:,^>:-itCOrLSrFCLLLLYLIGAVIAGRDFYKrLGVPRSASIKDIK 
KAYRKLALQLHPDRNPDDPQAQEKFQDLGAAYEVLSDSEKRKQY 
i bC^ <DGHQSSHGDIFSHFFGDFGFMFGGTPRQQDRNIPR 
GS D I IVDLEVTLEEVYAGNFVEWRNKPVARQAPGKRKCNCRQE 
MRTTQLGPGHFQMTQEVGCDECPOTKLWEERTLEVEIEPGVRD 
GMEYPFIGEGEPHVDG3PGDLRFRIKWKHPIPERRGDDLYTNV 
TISLVESLVGFEMDITHLDGHKVHISRDKITRPGAKLWKKGEGL 
PNFDNNWIKGSLIITFDVDFPKEQLTEEAREGIKQLLKQGSVQK 
VYNGLQGY 




5835 


3346 


RTRGMS KUCVPME FEE YLLRMFQGTFYLLQKI TKDNNAHTVKSR 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LEELDESYIEKFTDFLRLFVSVHLRRIESYSQFPWEFLTLLFK 
YTFHQPTHEGYFSCLDIWTLFLDYLTSKIKSRLGDKEAVLNRYE 
DALVLLLTEVLNRIQFRYNCAQLEELDDETLDDDQQTEWQRYLR 
QSLEWAKVMELLPTHAFSTLFPVLQDNLEVYLGLQQFIVTSGS 
GHRLNITAENDCRRLHCSLRDLSSLLQAVGRLAEYFIGDVFAAR 
FNDALTWERLVKVTLYGSC I KLYNI ETAVP S VLKPDL IDVHAQ 
SLAALQAYSHWLAQYCSEVHRQNTQQFVTLISTTMDAITPLIST 
KVQDKLLLSACHLLVSLATTVRPVFLISIPAVQKVFNRITDASA 
LRLVDKAQVLVCRALSNI LLLPWPNLPENEQQWPVRS INHASLI 
SALSRDYRNLKPSAVAPQRKMPLDDTKLIIHQTLSVLEDIVENI 
SGESTKSP.QICYQSLQESVCVSIALFPAFIHQSDVTDEML3FFL 
TLFRGLRVQMGVPFTEQIIQTFLNMFTREQLAESILHEGSTGCR 
WEKFLKI LQVWQEPGQVFKPFLPS I IALCMEQVYP I IAERPS 
PDVKAELFELLFRTLHHNWRYFFKSTVLASVQRGIAEEQMENEP 
QFSAIMQAFGQSFLQPDIHLFKQNLFYLETLNTKQKLYHKKIFR 
TAMLFQFVNVLLQ.VLVHKSHDLLQEE IG IAI YNMAS VDFDGFFA 
AFLPEFLTSCDGVDANQKSVLGRNFKMDRVRRER3RAKRRAEWA 
RKPGTCAARRGHIEASGRGLCPPCSLAAAHEMPADLVL 


5S57 


1712 


491 


VILGAGLRDKDMWIPWGLPRRLRLSALAGAGRFCILGSEAATR 
KHLPARNHCGJ,SDSSPQLWPF,PDFRNPFRKASKASLDFKRYVTD 
RRIAETLAQIYL3KPSRPPHLLLECNPGPGILTQALLEAGAKW 
ALESDKTFIPHLESLGKNLDGKLRVIHCDFFKLDPRSGGVIKPP 
AMSSRGLFKNLGIEAVPWTADIPLKWGMFPSRGEKRALWXLAY 
DLYSCTSIYKFGRIEVNMFIGEKEFQKLMADPGNPDLYHVLSVI 
KQIACEIKVLHMEPWSSFDIYTRKGPLENPKRRELLDQLQQKLY 
L1QMIPRQNLFTKNLTPMNYNIFFHLLKHCFGRRSATVIDHLRS 

KWLYDETLEDR 


5558 


1509 


95 


RAGCTHPQVPADLGAPAEPRRPQKTCVCLLQPQPGGQRGPTTMI 
TGVFSMRLWTPVGVLTSLAYCLHQRRVALAELQEADGQCPVDRS 
LLKLKMVQWFRHGARSPLKPLPLEEQVEWNPQLLEVPPQTQFD 
YTVTNLAGGPKPYSPYDSQYHETTLKGGMFAGQLTKVGMQQMFA 
LGERLRKNYVEDIPFLSPTFNPQEVFIRSTNIFRNLESTRCLLA 
GLFQCQKEGPIIIHTDEADSEVLYPNYQSCWSLRQRTRGRRQTA 
SLQPGISEDLKKVKDRMGIDSSDKVDFFILLDNVAAEQAHNLPS 
CPMLKRFARMI EQRAVDTSLY I LPKEDRES LQMAVGPFLHILES 
NLLKAMDSATAPDKIRKLYLYAAHDVTFIPLLMTLGIFDHKWPP 
FAVDLTMELYQH1ESKEWFVQLYYHGKEQVPRGCPDGLCPLDMF 
LNAMSVYTLSPEXYHALCSQTQVMEVGNEE 


5559 


150 


1983 


PLAATAHFAKMSRVAKYRRQVSEDPDIDSLLETLSPEEMEELEK 
ELDVVDPDGSVPVGLRQRNQTEKQSTGVYNREAMLNFCEKETKK 
LMQREM3MDESKQVETKTDAKNGEERGRDASKKALGPRRDSDLG 
KB P KRGGLKKS FSRDRDEAGGKSGEKPKEEKI IRG IDKGRVRAA 
VDKKEAGKDGRGEERAVATKECEEEKKGSDRNTGIiSRDKDKKREE 
MKEVAKKEDDEKVKGERRNTDTRKEGEKMKRAGGNTDMKKEDEK 
VKRGTGNTDTKKDDEKVKKNEPLHEKEAKDDSKTKTPEKQTPSG 
PTKPSEGPAKVEEEAAPSIFDEPLERVKNNDPEMTEVNVNNSDC 
I TNE I LVT^FTEALEFNTVVKLFALANTRADDHVAFAI AIMLKAN 
KTITSLNLDSNHIT3KGILAIFRALLQNNTLTELRFHNQRHICG 
GKTEMEIAKLLKENTTLLKLGYHFELAGPRMTVTNLLSRNMDKQ 
RQKRLQEQRQAQEAKGEKKDLLEVPKAGAVAKGSPKPSPQPSPK 
PSPKNSPKKGGAPAAPPPPPPPLAPPLIMENLKNSLSPATQRKM 
GDKVLPAQEKNSRDQLLAAIRSSNLKQLKKVEVPKLLQ 


S560 


9 


921 


• 'jFSALSVSMACLSPSQLQKFQQDGFLVLEGFLSAEECVAM 

qqrigeivaemdvplhcrtefstqeeeqlraqgstdyflssgdk 
irfffekgvfdekgnflvppeksinkighalhahdpvfksiths 
fkvqtlarslglqmpvwqsmyifkqphfggevsphqdasflyt 
eplgrvlgvwiavedatlengclwfipg3htsgvsrrmvrapvg 
sapgtsflgsepardnslfvptpvqrgalvlihgevvhkskq.nl 

S DRS RQAYT FHLMEASGTTWS PENWLQPTAELP FPQLYT 


55S1 


2175 


1775 


CYF1FQFFSSPYPGLHPHQTPAPLPNPGLYPPPVSMSPGQPPPQ 
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ID 
NO: 


beginning 

corresponding 

to first 
amino aoid 

sequence 


nucleotide 
corresponding 
amino acid 


(A=Alanine, C=Cystaine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Prolir.e, Q=Glutamine , R=Arginine, 

W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 

\=possible nucleotide insertion) 








qllaptyfsapgvmnfgnpsypyapgaleppppphlypntqaps 
qvyggvtyynpaqqqvqp:<psppertpqpvt;kppppewsrgs 
s 


5562 


342 


1385 


_b > I JiAAGAAGLVRGLKAGVLSQADYLNLVQC'ETLEDLKLH 
LQSTDYGNFLANEASPLTVSVTDDRLXEKMWEFRHMRNHAYEP 
LASFLDFITYSYMIDNVILLITGTLHQRSIAELVPKCHPLGSFE 

IRNTLYKAYLES FYKFCTLLGGTTADAMCP I LE FEADRRAFI IT 
INSFGTELSKEDRAKLFPHCGRLYPEGLAQLARADDYEQVKNVA 
DYYPEYKLLFEGA3SNPGDKTLEDRFFEHEVKLNKLAFLNQFHF 
GVFYAFVKLKEQECRNIVWIAECIAQRHRAKIDNYIPIF 


5563 


342 


1385 


S.5GKNDMAAAGAAGLVRGL.KAGVLSQADYLNLVQCETI,EDLKLH 
LQSTDYGNFLANEASPLTVSVIDDRLKEKMWEFRHMRNHAYEP 

QMBAVWIAQTPAELYNAILVDTPLAAFFQDCISEQDLDEMNIEI 
IRNTLYKAYLES F YKFCTLLG3TTADAMCPI LE FEADRRAF I IT 
INSFGTELSKEDRAKLFPHCGRLYPEGLAQLARADDYEQVKNVA 
DYYPEYKLLFEGAGSNPGDKTLEDRFFKHEVKLNKLAFLNQFHF 
GVFYAFVKLKF.QF.CRNIWIAECIAQRHRAKIDNYIPIF 


5564 


3 


914 


D} IJ'AF J 1 " 1 I '3AKRTGAI 

VPLTWRCDRDLDCSDGSDEERCRIEPCTQKGQCPPPPGLPCPCT 
GVSDCSGGTDKKLRNCSRLACLAGELRCTLSrDCIPLTWRCDGH 
PDCPDSSDELGCGTNEILPEGDATTMGPPVTLESVTSLRKATTM 
GPPVTLESVPSVGNATSSSAGDQSGSPTAYGVIAAAAVLSASLV 
TATLLLLSWLRAQERLRPLGI.LVAMKESLLLSEQKTSLP 


55S5 


993 


13 8 


JPARAGSISRPQRAPG3VSAVAMTAAVFFGCAFIAFGPA 

NKDGPTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSIN 
PGETAPSMRLLAYVSGLGFGIMSGVFSFVNTLEDSLGPGTVGrH 
GDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVL 
LTHLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRS 
LKLCLLCQDKNFLLYNQRSR 


5566 


2043 


1232 


SHIQHHGRGAQAPVKMVSWMISRAWLVFGMLYPAYYSYKAVKT 
KNVKEYVRWMMYWIVFALYTVIETVADQTVAWFPLYYELKIAFV 
IWLLSPYTKGASLIYRKFLHPLLSSKEREIDDYIVQAKERGYET 
MVNFGRQGLNLAATAAVTAAVKSQGAITERLRSFSMHDLTTIQG 

EGPYSDNEMLTHKGPRRSQSMKSVKTTKGRKEVRYGSLKYKVKK 
RPQVYF 


55" 


1554 


233 


EFLGSGVSPDLANEDGLTALHQCCIDDFREMVQQLLEAGANINA 
CDSECRTPLHAAATCGHLHLVELLIASGANLLWNTDGNMPYDL 
CDDEQTLDCLETAMADRGITQDSIEAARAVPELRMLDDIRSRLQ 
AGADLEAPLDHGATLLHVAAANGFSEAAALLLEHRASLSAKDQD 
GWEP -PIAAAYWGQVPLVELLVAHGADLNAKSLMDETPLDVCGDE 
EVRAKLLELKHKHDALLRAQSRQRSLLRRRTSSAGSRGKWRRV 

PPPPEEDNPEWRPHNGRVGGSPVRHLYSKRLDRSVSYQLSPLD 
STTPHTLVHDKAHHTLADLKRQRAAAKLQRPPPEGPE3PETAEP 
GLPGDTVTPQPDCGFRAGGDPPLLKLTAPAVEAPVERRPCCLLM 


5568 


1731 


587 


AEDxCPASRRGAGTTAAMAASGPGCRSWCLCPEVPSATFFTALL 
- PRLFLLQQPLAPSGLTLKSEALRNWQVYRLVTYI FVYE 
NPISLLCGAIIIWRFAGNFERTVGTVRHCFFTVIFAIFSAIIFL 
SFEAVSSLSKL3EVEDARGFTPVAFAMLGVTTVRSRMRRALVFG 
MWPSVLVPWLLLGASWLIPQTSFLSNVCGLSIGLAYGLTYCYS 
IDLSERVALKLDQTFPFSLMRRISVFKYVSGSSAERRAAQSRKL 
NPVPGSYPTQSCHPHLSPSHPVSQTQHASGQKLASWPSCTPGHM 
PTLPPYQPASGLCYVQNHFGPNPTSSSVYPASAGTSLGIQPPTP 
VNSPGTVYSGALGTPGAAGSKESSRVPMP 


5569 


2 


83 5 | QTPCPLAWERGSRSEDISVPGQKPPTCSSFSGMDVGPSSLPHLG 
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SEQ 
ID 
NO: 


nucleotide 
sequence 


Predicted end 
nucleotide 

corre sponding 
to first 
amino acid 
residue of 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, VeValine, 
W=Tryptophan, Y=Tyrosine, X=Onknown, *=Stop 
Codon, /=possible nucleotide deletion, 








LXLLLLLLLLPLRGQANTGCYGIPGMPGLPGAPGKDGYDGLPGP 
KGEPGIPAIPGIRGPKGQKGEPGLPGHPGKNGPMGPPGMPGVPG 
PMGIPGEPGEEGRYKQKFQSVFTVTRQTHQPPAPNSLIRFNAVL 
TNPQGDYDTSTGKFTCKVPGLYYFVYHASHTANLCVLLYRSGVK 
WTFCGHTSKTNQVWSGGVLLRLQVGEEVWLAVWDYYDMVGIQG 


5570 


264 


946 


RDRRDRGGVATSTEEPARPRAPQSRGPGPVSQTGRGRERGGGDT 
MSSPSPGKRRMDTDWKLIESKHEVTILGGLNEFWKFYGPQGT 
PYEGGVWKVRVDLPDKYPFKSPSIGFMNKIFHPNIDEASGTVCL 
DVINQTWTALYDLTNIFESFDPQLLAYPNPIDPLNGDAAAMYLH 
3PEEYKQKIKEYIQKYATEEALKEQEEGTGDSSSESSMSDFSED 


5571 


2S4 


946 


RDRRDRGGVATSTEEPARPRAPQSRGPGPVSQTGRGRERGGGnr 
MSSPSPGKRRMDTDWKL1ESKHEVTILGGLNEFVVKFYGPQGT 
PYEGGWKVRVDLPDKYPFKS PS 1GFMNKI FHPN IDEASGTVCL 
D V INQTWTAL YDLTN I FES FLPQLLA YPNP I DPLNGDAAAM YLH 
RPEEYKQKIKEYIQKYATEEALKEQEEGTGDSSSESSMSDFS3D 
SACDMEL 


5572 


2802 


2085 


RTDYRTGIPGRRFRVMAAGDGDVKLGTLGSGSESSNDGGSESPG 
DAGAAAEGGGWAAAALALLTGGGEMLLNVALVALVLLGAYRLWV 
RWGRRGLGAGAGAGEESPATSLPKMKKRDFSLEQLRQYDGSRNP 
RILLAVNGKVFDVTKGSKFYGPAGPYGIFAGRDASRGLATFCLD 
XDALRDEYDDLSDLNAVOMESVREWEMQFKEKYDYVGRLLKPGE 
EPSEYTDEEDTXDHNKQD 


5573 


2562 


219 


VPARTPNAEDQGPEARAATATPCQSGGRRRAGEAAEDGVKMAAF 
SEMGVMPEIAQAVEEMDWLLPTDIQAESIPLILGGGDVLMAAET 
GSGKTGAFS IPVIQ I VYETLKDQQEGKKGKTT I KTGASVLNKWQ 
MNPYDRGSAFAIGSDGLCCQ5REVKEWHGCRATKGLMKGKHYYE 
VSCHDQGLCRVGWSTMQASLDLGTDKFGFGFC-GTGKKSHNKQFD 
NYGEEFTMHDTIGCYLDIDKGHVKFSKNGKDLGLAFEIPPHMKN 
QALFPACVLKNAELKFNFGEEEFKFPPKDGFVALSKAPDGYIVK 
SQHSGNAQVTQTKFLPNAPK.O,LIVEPSRELAEQTLNNIKQFKKY 
IDNPKLRELLIIGGVAARDQLSVLENGVDIWGTPGRLDDLVST 
GKLNLSQVRFLVLDEADGLLSQGYSDFINRMHNQIPQVTSDGKR 
LQVIVCSATLHSFDVKKLSEKIMHFPTWVDLKGEDSVPDTVHHV 
WPVNP KTDRLWERLGKS H I RTDDVHAKDNTRPGAKS PEMWS EA 
IKILKGEYAVRAIKEHKMDQAIIFCRTKIDCENLEQYFIQQGGG 
PDKKGHQFSCVCLHGDRKPHERKQNLERFKKGDVRFLICTDVAA 

TEKEKVWYHVCSSRGKGCYNTRLKEDGGCTIWYNEMQLLSEIEE 
HLNCTISQVEPD1KVPVDEFDGKVTYGQKRAAGGGSYKGHVDIL 
APTVQELAALEKEAQTSFLHLGYLPNQLFRTF 


5574 


1731 


352 


NEGLEVFKEQELQPEDKGAVP3DASTERSAMASLGLQLVGYILG 
LLGLLGTLVAKLLPSWKTSSYVGASIVTAVGFSKGLWMECATHS 
TGITQCDI YSTLLGLPAD IQAAQAMMVTSSAISSLACI ISWGM 
RCTVFCQESRAKDRVAVAGGVFFILGGLLGFIPVAWNLHGILRD 
FYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCF3CSCQRN 


5575 


45G 


766 


LLWALPCPPPTAAAVLLSSTGLMELLEKMLALTLAKADSPRTAL 
LCSAWLLTASFSAQQHKGSLQKDPLLSQACVGCLEALLDYLDAR 
SPDIGRNSPHYLMFP 


557G 


249 


2146 


RSWGAPWFWRKRLLRRRHMPLRLAMVGCAFVLFLFLLHRDVSSR 
EEATEKPWLKSLVSRKDHVLDLMLEAMEOTLRDSMPKLQIRAPEA 
QQTLFSINQSCLPGFYTPAELKPFWERPPQDPNAPGADGKAFQK 
SKWTPLETQEKEEGYKKHCFNAFASDRISLQRSLGPDTRPPECV 
DQKFRRCPPIATTSVIIVFHNEAWSTLLRTVYSVLHTTPAILLK 
EtlLVDDASTEEHLKEKLEQYVKQLQWRWRQEERKGLITARL 
LGASVAQA3VLTFLDAHCECFHGWLEPLLARIAEDKTWVSPDI 
VTIDLNTFEFAKPVQRGRVHSRGNFDWSLTFGWETLPPHEKQRR 
KDETYPIKSPTFAGGLFS1SKSYFEHIGTYDNQMEIWGGENVEM 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 


Predicted end 
nucleotide 
location 
corresponding 
to first 

residue of 
amino acid 
sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=pcssible nucleotide insertion) 








S F RVK C CGGQLE 1 1 P CS WGHVFRTKS PHTFPKGTS VIARNQVR 
LAEVWMDSYKKIFYRRNLQAAKMAQEKSFGDISERLQLREQLHC 
HNFSKYLHNVYPEMFVPDLTPTFYGAIKWLGTNQCLDVGEKWRG 
GKPLIMYSCHGLGGNQYFEYTTQRDlRmilAKQLCLHVSKGALG 
LGSCHFTGKNSQVPKDEEWELAQDQLIRNSGSGTCLTSQDKKPA 
MAPCNPSDPHQLWLFV 


5577 


3 


1275 


RNSDCSCGEISVHCLPWVLF1LDLKVESSMFGPLKLILLPVLLD 
YSLGLNDLWSPPELTVHVGDSALMGCVFQSTEDKCIFKIDWTL 
SPGEHAKDEYVLYYYSNLSVPIGRFQNRVHMGDILCNDGSLLL 
QDVQEADQGTYICEIELK3ESQVFKKAWLWLPEEPKELMVHV 
GGLIQMGCVFQSTEVKHVTKVEWIFSGRRAKEEIVFRYYHKLRM 
S VS YSQ S WGHF QNRV1IL VGD I FRNDGS I MEQGVRESDGGNYTCS 
IHLGNLVFKKTIVLHVSPEEPRTLVTPAALRPLVLGGNQLVIIV 
GIVCATILLLPVLILIVKKTCGNKSSVNSTVLVKHTKKTNPEIK 
EKPCHFERCEGEKHIYSPIIVREVIEEEEPSEKSEATYMTMHPV 
DF i E TQOAF 


5578 


> 


783 


- <E IflbPGAGRAPPELPSRNCGYREVEYWUQRYQGAADSAPYD 
WFGDFSSFRAIiLEPELRPEDRILVLGCGNSALSYELFLGGFPNV 
TSVDYSSV'WAAMQARYA-IVPQLRWETMDVRKLDFPSASFDWL 
EKGTLDALLAGERDPWTVSSEGVHTVDQVLSEVSHVLVPGGRFI 
SMTSAAPHFRTRHYAQAYYGWSLRHATYGSGFHFHLYLMHKC-GK 
LSVAQLALGAQILSPPRPPTSPCFLQDSDHEDFIiSAIQL 


5579 


3 


1540 


RNSGLARGA3ALARHGGGLAGGVGWDCGACASRCQGVMEGLLTR 
CRALPALATCSRQLSGYVPCRFHHCAPRRGRRLLLSRVFQPQNL 
REDRVLSLQDKSDDLTCKSQRLMLQVGLIYPASPGCYHLLPYTV 
RAMEKLVRVIDQEMQAI GGQKVMMPB hS PAELWQATNRWDLMGK 
ELLRLRDRHGKEYCLGPTHEEAITALIASQKKLSYKQLPFLLYQ 
VTRKFRDEPRPRFGLLRGREFYMKDMYTFDSSPEAAQQTYSLVC 
DAYCSLFNKLGLPFVKVQADVGTIGGTVSHEFQLPVDIGEDRLA 
ICPRCSFEANMETLDLSQMNCPACQGPLTKTKGIEVGHTFYLGT 
KYSSIFNAQFTNVCGKPTLAEMGCYGLGVTRZLAAAIEVLSTED 
CVRWPSLLAPYQACLIPPKKGSKEQAASELIGQLYDKITEAVPQ 
LHGEVLLDDRTHLTIGNRLKDANKFGYPFVIIAGKRALEDPAHF 
EVWCQNTGEVAFLTKDGVMDLLTPVQTV 


5580 


1681 


« 


ADAGTRCIPGFWPSGAGYSAPAQRGRRSSGRMRAAAAPGLTAP 
WRLLQCCELEAGELGKAVPAAAMGPSALGQSGPGSMAPWCSVSS 
GPSRYVLGMQELFRGHSKTREFLAHSAKVHSVAWSCEGRRLASG 
SFDKTASVFLLEKDRLVKENNYRGHGDSVDQLCWHPSNPDLFVT 
ASGDKTIRIWDVRTTKCIATVNTKGENINICWSPDGQTIAVGNK 
DDWTFIDAKTHRSKAEEQFKFEYNEISWNNDNNMFFLTIIGISrGC 
INILSYPELKPVQSINAHPSNCICIKFDPMGKYFATGSADALVS 
LWDVDELVCVRCFSR1DWPVRTLSFSHDGKMLASASEDHFIDIA 
EVHTGDKLWEVQCESPTFTVAWHEKRPLLAFACDDKDGKYDSSR 
EAGTVKLFGLPNDS 


55ai 


54 


947 


GC-GSGPRAPSATLLDTGESVAAVASGEDKGIAASAAAAAVFACS 
CSPDPQSSTMNPVYSPVQPGAPYGNPKNMAYTGYPTAYPA&APA 
YNPSLYPTNSPSYAPEFQFLHSAYATLLMKQAWPQNSSSCGT3G 
TFHLPVDTGTENRTYCASSAAFRYTAGTPYKVPPTQSNTAPPPY 
SPSPHPYQTAMYPIRSAYPQQNLYAQGAYYTQPVYAAQPHVIHH 
TTWQPNSIPSAIYPAPVAAPRTNGVAMGMVAGTTMAMSAGTLL 
TTPQHTAIGAHPVSMPTYRAQGTPAYSYVPPHW 


5582 


5775 


2739 


1 1 TNNNNVI I PLVIAYHLSGSAQARGERS PAERLMERQKRKAD I 
3KGLQFIQ3TLPLKQEEYEAFLLKLVQNLFAEGNDLFREKDYKQ 
ALVQYMEGLNVADYAASDQVALPRELLCKLHVWRAACYFTMGLY 
■vOiLhiliSEKALGLDSESIRALFRKARALNELGRHKEAYECSSRC 
SLALPHDESVTQLGQELAQKLGLRVRKAYKRPQELETFSLLSKG 
TAAGVADQGTSNGLGSIDDIETDCYVDPRGSPALLPSTPTMPLF 
PHVLDLLAPLDSSRTLPSTDSLDDFSDGDVFGPELDTLLDSLSL 
VQGGLSGSGVPSELPQLI P VFPGGTPLLP P WGGS IPVSSPLPP 
ASFGL^DPSKKLAASVLDALDPPGPTLDPLDLLPYSETRLDAL 
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Predicted 
beginning 
lucleotide 
location 
corresponding 



:■ fir; 



Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, P=Phenylalanine , G=Glycine, 
H=Histidine, l=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonir_e, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 
DSF3STRGSLDKPDSFMEETNSQDHRPPSGAQKPAPSPEPCMPN 
TALLIKNPLAATHEFKQACQLCYPKTGPRAGDYTYREGLEHKCK 
RDI LLGRLRS SEDQTWKR I RPRPTKTS F VGS YYLCKDM INKQD C 
KYGDNCTFAYHQEEIDWTEERKGTLNRDLLFDPLGGVKRGSLT 
IAKLLKEHQG I FTFLCE I CFDSKPRI IS KGTKDSPS VCSNLAAK 
HSFYNNKCLVHI VRSTSLKYSKIRQFQEHFQFDVCRHEVRYGCL 
REDSCHFAHSFIELKVWLLQQYSGMTHEDIVQ3SKKYWQQMEAH 
AGKASSSMGAPRTHGPSTFDLQMKFVCGQCWRNGQWEPDKDLK 
YCSAKARHCWTKERRVLLVMS KAKRKWVSVRPLPS I RNFP0.QYD 
LCIHAQNGRKCQYVGNCSFAHSPEERDNWTFMXENKILDMQQTY 
DMW L KKHNPGKPGEGTP I S SREGEKQIQMPTDYAD I MMGYHCWL 
CGKNSNSKKQWQQHIQSEKHKEKVFT3DSDASGWAFRFPMGEFR 
LCDRLQKGKACPDGDKCRCAHGQEELNEWLDRREVLKQKLAKAR 
KDMLLCPRDDDFGKYNFLLQE D GDIAGATPEAPAAAATATTG E 
SSGCRQGRPGRSDRPRPPPRRHKMVKETRYYDILGVKPSASPEE 
I KKAYRKLALKYHPDKNPDEGE KFKL I SQAYE VLSDPKKRDVYD 
QGGEQAIKEGGSGSPSFSSPMDIFDMFFGGGGRMARERRGKNW 
HQLSVTLEDLYNGVTKKLAL2KNVICEKCEGVGGKKGSVEKCPL 
CKGRGMHimQQIGPGMVQQlQTVCISCKGQGERINBKDRCESC 
SGAKVIRLKKIIEVHVEKGMKDGQKIjFIIGEGDQRPELEPGDVI 
IVLDQKDHSVFQRR3HDLIMKMKIQLSEALCGFKKTIKTLDNRI 
LVITSKAGEVIKHGDLRCVRDEGMPIYKAPLEKGILIIQFLVIF 

pekhwlsleklpqleallpprqkvritddmdqvelkefcpnfion 
krqhrfaykkdengpqagvqcqta 
~ ^sgcrqgrpgrsdrl'rpppWikmvketryydilgvkpsas 7 
ikkayrklalkyhpdknpdegekfklisqayevlsdpkkrdvyd 

QGGEQAIKEGGSGSPSFSSPMDIFDMFFGGGGRMARERRGKNW 
BQhSVTLEVLYmVTKKLAhQKNVlCEKCEGVGGXKGSVBKCPL 
CKGRGMHIHIQQIGPGMVQQIQTVCIECKGQGERINPKDRCESC 
SGAKVIREKKIIEVHVEKGMKDGQKILFHGEGDQEPELEPGDVI 
IVLDQKDHSVFQRRGHDLIMKMKIQLSEALCGFKKTIKTLDNRI 
LVITSKAGEVIKHGDLRCVRDEGMPIYKAPLEKGILIIQFLVIF 
PEKHWLSLEKLPQLEALLPPRQKVRITDDMDQVELKEFCPNEQN 
/JRQHREAYEEDEDCPQAGVQCQTA 
LPAGTPESSLHEALDQCMTALDLFLTNQFSEALSYLKPRTKESM 
YHSLTYATILEMQAMMTFDPQDILLAGNMMKEAQ'MLCQRHRRKS 
SVTDSFSSLVNRPTLGQFTEEEIHAEVCYAKCLLQRAALTFLQD 
ENMVS FIK3GI KVRNS YQTYKELDSLVQS SQYCKGENHPHFEGG 
VKLGVGAFNLTLSMLPTRILRLLEFVGFSGNKDYGLLQLEEGAS 
GHSFRSVLCVMLLLCYHTFLTFVLGTGNVNIEEAEKLLKPYLNR 
YPKGAIFLFLAGRIEVIKGWIDAAIRRFEECCEAQQHWKQFHHM 
CYWELMWCFTYKGQWKMSYFYADLLSKENCWSKATYIYMKAAYlj 
3MFGKEDHKPFGDDEVELFRAVPGLKLKIAGKSLPTEKFA1RKS 
RRYFSSKPISLPVPALEMMYIWNGYAVIGKQPKLTDGILEIITK 
AEEMLEKGPENEYSVDDECLVKLLKGLCLKYLC-RVQEAEENFRS 
I SANE KK I K YDH YL I PNALLELALLLMEQDRNEEAI KLLESAKQ 
HYKNYSMESRTHFRIQAATLQAKSSLBNSSRSMVSSVSL 
1PAGTPES3LHEALDQCMTALDLFLTNQFSEALSYLKPRTKESM 
YHSLTYAT I LEMQAMMTFDPQD I LLAGMMMKEAQMLCQRHRRKS 
SVTDSFSSLVNRPTLGQFTEEEIRAEVCYAKCLLQRAArjTFLQD 
ENMVSF1KGGIKVRIISYQTYKELDSLVQSSQYCKGENHPHFEGG 
VKLGVGAFNLTLSMLPTRILRLLEFVGFSGNKDYGLLQLEEGAS 
GHSFRSVLCVMLLLCYHTFLTFVLGTGNVNIEEAEKLLKPYLNR 
YPKGAIFLFLAGRIEVIKGNIDAAIRRFEECCEAQQHWKQFHHM 
CYWELMWCFTYKGQWKMSYFYADLLSKENCWSKATYIYMKAAYL 
SMFGKEDHKPFGDDEVELFRAVPGLKLKIAGKSLPTEKFAIRKS 
RRYFSSNPISLPVPALEMMYIWNGYAVIGKQPKLTDGILEIITK 
AEEMLEKGPENEYSVDDECLVKLLKGLCLKYLGRVQEAEENFRS 
ISANEKKIKYDHYLIPWALLELALLLMEQDRNEEAIKLLESAKQ 
HYK^YSMESRTHFRIQAATLQAKSSLENSSRSMVSSVSL 
" SSAVPDGAVGRPVAVAVGGPPHSCRCRPCCLMAAIGVHLGCT 
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SEQ 
ID 
WO: 


Predicted 
beginning 

corresponding 
to first 
amino acid 

amino acid 


Predicted end 
nucleotide 

cor re sponding 
to first 
amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








CVAVYXDGPJVGWANDAGDRVTPAWAYSENEEIVGIAAKQSRI 
RNISNTVMKVKQILGRSSSD3QAQKYIAESKCLVIEKNGKLRYE 
IDTGEETKFVNPEDVARLIFSKMKETAHSVLGSDANDWITVPF 
DFGEKQKNALGEAARAAGFNVLRLIHEPSAALLAYGIGQDSPTG 
KSNILVFKLGGTSLSLSVMEYNSGIYRVLSTNTDDNIGGAHFTE 
TLAQYLAS EFQRS FKHDVRGNARAMHKLTNSAEVAKHSLSTLGS 
ANCFLDSLYEGQDFDCNVSRARFELLCSPLFNKCIEAIRGLLDQ 
NGFTADDINKWLCGGSSRIPKLQQLIKDLFPAVELLNSIPPDE 
VIPIGAAIEAGILIGKENLLVEDSLMIECSARDILVKGVDESGA 
SRFTVLFPSGTPLPARRQHTLQAPGSISSVCLELYESDGKNSAK 
SETKFAQWLQDLDKKENGLRDILAVLTMKRDGSLHVTCTDQET 
GKCEA I S I E I AS 


5588 


3 


589 


TPPPPEQAMVAATVAAAWLLLWAAACAQQEQDFYDFKAVNIRGK 
LVSLEKYRGSVSIiV^/NVASECGFTDQHYRALQQLQRDLGPHHFM 
VLAFPCNQFGQQEPDSNKEIESFARRTYSVSFPMFSKIAVTGTG 
AHPAFK1" * i T ! M 1 l L , r i > _ kWGAWDPTVSVEBV 
RPOITALVRKLILLKREDL 


S589 - 


1884 


553 


LRQAWHEGGIGQTDKERGAAALPGEEGDPTRGKSLGRASWESGS 
PRRPRSPF3SFLPRPICLSLEARPCSIEDRRNKSLIGRPGAPAS 
GLNRSSGLWLGPDRCRPRSRCSCRVMENPSPAAALGKALCALLL 
ATLGAAGQPLGGESICSARAPAKYSITFTGKWSQTAFPKQYPLF 
RPPAQWSSLLGAAHSSDYSMWRKNQYVSNGLRDFAERGEAWALM 
KE1EAAGEALQSVHAVFSAPAVPSGTGQTSAELEVQRRHSLVSF 
WRIVPSPDWFVGVDSLDLCDGDRWREQAALDLYPYDAGTDSGF 
TFSSPNFATIPQDTVTEITSSSPSHPANSFYYPRLKALPPIARV 
TLLRLRQSPRAF1PPAPVLPSRDNEIVDSASVFETPLDCEVSLW 
SSWGLCGGHCGRLGTKSRTRYVRVQPANNGSPCPELSEEAECVP 
DNCV 


5590 


72 


896 


LCSSGALRLLPAMVAWRSAFLVCLAFSLATLVQRGSGDFDDFNL 
EDAVKETSSVKQPWDHTTTTTTNRPGTTRAPAKPPGSGLDLADA 
LDDQDDGRRKPGIGGRERWNHVTTTTKRPVTTRAPANTLGKDFD 
LADALDDRNDRDDGRRKPIAGGGGFSDKDLEDIVGGGEYKPDKG 
KGDGRYGSNDDPGSGMVAEPGTIAGVASALAMALIGAVSSYISY 
QQKKFCFSIQQGLNADYVKGENLEAWCEEPQVKYSTliHTQSAE 
PPPPPEPARI 


5591 


68 


1494 


AGSSRRAAAERLLVSAGCRSLAGRASGVLLLPAELLPGEEEAMA 
LRVTRNSKINAENKAKINMAGAKRVPTAPAATSKPGLRPRTALG 
DIGNKVSEQLQAKMPMKKEAKPSATGKVIDKKLPKPLEKVPMLV 
PVPVSEPVPEPEFEPEPEPVKEEKLSPEPILVDTASPSPMETSG 
CAPAEEDLCQAFSDVILAVWDVDAEDGADPNLCSEYVKDIYAYL 
RQLEEEQAVRPKYLLGREVTGNMRAILIDWLVQVQMKFRLLQET 
MYMTVS I IDRFMQNNCVPKKMLQLVGVTAMF IAS KYEEMYPPE I 
GDFAFVTDNTYTKHQIRQMEMKILRALNFGLGRPLPLHFLRRAS 
KIGEVDVEQHTLAKYLMELTMLDYDMVHFPPSQIAAGAFCLALK 
ILDNGEWTPTLQHYLSYTEESLLPVMQHLAKNAAMWQGLTKHM 
TVKNKYATSKHAKI STLPQLNSALVQDLAKAVAKV 


5592 


242 


924 


YGES KDKNQKDIiLSALVLTTVNCLPTP IMAKSAEVKLAI FGRAG 
VGKSALWRFLTKRFIWEYDPTLESTYHHQATIDDEWSMEILD 
TAGQEDTIQREGHMRWGEGFVLVYDITDRGSFEEVLPLKNILDE 
I KK P KNVTL I r. VGNKADLDHS RQ VS TEEGEKLATELACAF YE CS 
ACTGEGNITEIFYELCREVRRRRMVQGKTRRRSSTTHVKQAINK 
MLTKISS 


5593 


3 




HASGGRAAWMAAERGAGQQQSQEMMEVDRRVESEESGDEEGKKH 
SSGIVADLSEQSLKDGEERGEEDPEEEHELPVDMETINLDRDAE 
DVDLNHYRIGKIEGFEVLKKVKTLCLRQNLIKCIEKLEELQSLR 
KLD1.YDNQIKKIENLEALTELEILDISFNLLRNIKGVDKLTRLK 
KLFLVNNKISKIENLSNLHQLQMLELGSNRIRAIENIDTLTNLE 
SLFLGKNKITKLQNLDALTNLTVLSMQSNRLTKIEGLQIILVNLR 
ELYLSHNGIEVIEGLENNNKLTMLDIASNRIKKIENISHLTELQ 
EFI-JMNDNLLESWSDLDELKGARSLETVYIjERNPIjQKDPQYRRKV 
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SEQ 


Predicted 


Predicted end 


Amino acid segment containing signal peptide 


ID 




nucleotide 


(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 


NO: 


nucleotide 

corresponding 
to first 


corre sponding 


Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 




amino acid 




S=Serine, T=Threonine, V=Valine, 




residue of 




W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 








Codon, /=possible nucleotide deletion, 








\=possible nucleotide insertion) 








MIALPSVRQIDATFVRF 


5594 


3 


1113 


HASGGRAANMAA3RGAGQQQSQEMMEVDRRVESEESGDEEGKKH 
SSGIVADLSEQSLKDGEERGEEDPEESHELPVDMETINLDRDAE 
DVDLNHYRIGKIEGFEVLKKVKTLCL3QNLIKCIENLEELQSLR 
ELDLYDNQIKKIENLEALTELEILDISFNLLRNIEGVDKLTRLK 
KLFLVNNKISKIENLSNLHQLQMLELGSNRIRAI3WIDTLTWLE 
SLFLGKNKITKLQNLDALTNLTVLSMQSNRLTKISGLQNLOTLR 
ELYLSHNGIEVIEGLENNNKLTMLDIASNRIKKI3NISHLTELQ 
EFWMNDNLLESWSDLDELKGARSLETVYLERNPLQKDPQYRRKV 
MLALPSVRQIDATFVRF 


5595 


3 


14 7 5 


ARWNGRWVQVPAWPGPGCGTNASGERQRQLPRAWRPVGRTLGSE 
PIALAWSPPLYLFPIPLPSWAVSQPTPTLGTMFADLDYDIEEDK 
LGIPTVPGKVTLQKDAQNLIGISIGGGAQYCPCLYIVQVFDNTP 
AALDGTVAAGDEITGVNGRSIKGKTKVEVAKMIQEVKGEVTIHY 
MKLQADPKQGMSLDIVLKKVKHRLVENMSSGTADALGLSRAILC 
NDGLVKRLEELERTAELYKGMTEHTKNLLRAFYELSQTHRAFGD 
VFSVI GVREPQPAASEAFVKFADAHRS IEKFGI RLLKT I KPMLT 
DLNTYLNKAIPDTRLTIKKYLDVKFEYLSYCLKVKEMDDEEYSC 
IALGEPLYRVSTGNYBYRLILRCRQEARARFSQMRKDVT.EKHHu 
LDQKHVQDIVFQLQRLVSTM3KYYNDCYAVLRDADVFPIEVDLA 
HTTLAYGLNQEEFTDGEEEEEEEDTAAGEPSRDTRGAAGPLDKG 
GSWCDS 


559S 


698 


219 


gavlapsslpaaelaaqgesqsledlsntsrptsevykisfifp 
ngdkydgdctr'tssgiyerngigihttpngivytqswkddkmng 
fgrlehfsgavyegqfkdnmfhglgtytfpngakytgnfnenrv 
kgegeythiqgtrmdwtfhfrscsqr 


5597 


3 


731 


ISCKMAADGQSSLPASWRSVTT,THVF.YPARnr,RRHT,r,AYLSLSP 
VFVIVGFVTLIIFKRELHTISFLGGLALMEGVNWLIKNVIQEPR 
PCGGPHTAVGTKYGMPSSHSQFMWFFSVYSFLFLYLRMHQTNNA 
RFLDLLWRHVLSLGLLAVAFLVSYSRVYLLYHTWSQVLYGGIAG 
GLMA1AWFIFTQEVLTPLFPRIAAWPVSEFFLIRDTSLIPNVLW 
FEYTVTRAEARNRQRKLGTKLQ 


5590 


326 


2440 


GIGPIAASFIFCKVASLYIFLSPPPPSVSGVPYSPANSSWSCAL 
VPLLGSGVPPHPPAPSPCCSGQTMLKMLSFKLLLLAVALGFFEG 
DAKFGERNEGSGARRRRCLNGNPPKRLKRRDRRMMSQLELLSGG 
EMLCGGFYPRLSCCLRSDSPGLGRLENKIFSVTHWTECGKLLEE 
IKCALCSPHSQSLFHSPEREVLERDLVLPLLCKDYCKEFFYTCR 
GHIPGFLQTTADEFCFYYARKDGGLCFPDFPRKQVRGPASNYLD 
QMEEYDKVEEISRKHKHNCFCIQEWSGLRQPVGALHSGDGSQR 
LFILEKEGYVKILTPEGEIFKEPYLDIHKLVQSGIKGGDERGLL 
SLAFHPNYKKNGKLYVSYTTNQERWAIGPHDHILRWEYTVSRK 
NPHQVDLRTARVFLEVAELHRKHLGGQLLFGPDGFLYIILGDGM 
ITLDDMEEMDGLSDFTGSVLHLDVDTDMCNVPYSIPRSNPHFNS 
TNQPPEVFAHGLHDPGRCAVDRHPTDININLTILCSDSNGKNRS 
SARILQIIKGKDYESEPSLLEFKPFSNGPLVGGFVYRGCQSERL 
YGSYVFGDRNGNFLTLQQSPVTKQWQEKPLCLGTSGSCRGYFSG 
HILGFGEDELGEVYILSSSKSMTQTHNGKLYKIVDPKRPLMPEE 
CRATVQPAQTLTSECSRLCRNGYCTPTGKCCCSPGWEGDFCRTG 


5599 


326 


2440 


5 i AA3FIFCBVASLYIFLSPPPPSVSGVPYSPANSSWSCAL 
VPLLGSGVPPHPPAPSPCCSGQTMLKMLSFKLLLLAVALGFFEG 
DAKFGERNEGSGARRRRCLNGNPPKRLKRRDRRMMSQLELLSGG 

emlcggfyprlscclrsdspglgrlenkifsvtnntecgkllee 
i kcalcs phsqslfhs pere vlerdlvlpllckdyckeffytcr 
ghipgflqttadefcfyyarkdgglcfpdfprkqvrgpasnyld 
qmeeydkveeisrkhkhncfciqewsglrqpvgalhsgdgsqr 
lfilekegyvkiltpegeifkepyldihklvqsgikggdergll 
slafhpnykkngklyvsyttnqerwaigphdhilrweytvsrk 
nphqvdlrtarvflevaelhrkhlggqllfgpdgflyiilgdgm 
itlddmeemdglsdftgsvlrldvdtdmcnvpysiprsnphfns 
tt:cppevfahhlhdpgrcavdrhptdininltilcsdsngknrs 



349 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 

corresponding 
to first 

residue of 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 

residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion} 








SARILQIIKGKDYESEPSLLEFKPFSNGPLVGGFVYRGCQSERL 
YGSYVFGDRNGNFLTLQQSPVTKQWQEKPLCLGTSGSCRGYFSG 
HILGFGEDELGEVYILSSSKSMTQTHNGKLYKIVDPKRPLMPEE 
CRATVQPAQTLTSECSRLCRNGYCTPTGKCCCSPGWEGDFCRTG 


5S00 


1977 


1244 


SLRVLSGHLMQTRDLVQPDKPASPKFIVTLDGVPSPPGYMSDQE 
EDMCFEGMKPVNQTAASNKGLRGLLHPQCLHLLSRQLEDPNGSF 
SXAEMSELSVAQKPEKLLERCKYWPACKNGDECAYHHPISPCKA 
FPNCKFAEKCLFVHENCKYDAKCTKPDCPFTHVSRRIPVLSPKP 
AVAPPAPPSSSQLCRYFPACKKMECPFYHPFJJCRFNTQCTRPDC 
TFYHPTINVPPRHALKWIRPQTSE 


5S01 


1977 


1244 


SLRVLSGHLMQTRDLVQPDKPASPKFIVTLDGVPSPPGYMSDQE 
EEMCFEGMKPVNQTAASNKGLRGLLHPQQLHLLSRQLEDPNGSF 
SNAEMSELSVAQKPEKLLERCKYWPACKNGDECAYHHP1SPCKA 
FPNCKFAEKCLFVHPNCKYDAKCTKPDCPFTHVSRRIPVLSPKP 
AVAPPAPPSSSQLCRYFPACKKMECPFYHPKHCRFNTQCTRPDC 
TFYHPTINVPPRHALKWIRPQTSE 


SS02 


246 


766 


YHTSCTVWRTAKEALENTEVPVGCLMVYNNEWGKGRNETOQTK 
NATRHAEMVAIDOVLDWCRQSGKSPSEVFEHTVTiYVTVEPCIMC 
AAALRLMKIPLWYGCQNERFGGGGSVLNIASADLPNTGRPFQC 
IPGYRAEEAVRMr.KTFYKQEMPWAPKSKVRKKECQQILWMF 


5S03 


1 


565 


jRGRTPISGGERGCAQYPIPATPARSCEiJRTMPGAGDGGKAPAR 
WLGTGLLGLFLLPVTLSLEVSVGKATDIYAVNGTEILLPCTFSS 
CFGFEDLHFRWTYNSSDAFKILIEGTVKNEKSDPKVTLKDDDRI 
TLVGSTKEKRNNISIVLRDLEFSDTGKYTCHVKNPKENNLQHHA 
TIFLQWDRRMQ 


5S04 




1506 


EDIFPAQLLKLQRHERVWQQKPPVRDHRSWGGSGAGGVAGREWT 
DQGQVALGGHYMAEGEGYFAMSEDELACSPYIFLGGDFGGGDFG 
GGDFGGGDFGGGDFGGGGS FGGHCLDYCESPTAHCNVLNWEQVQ 
RLDGILSETIPIHGRGNFPTLELQPSLIVKWRRRLAEKRIGVR 
DVRLNGSAASHVLHQDSGLGYKDLDLIFCADLRGEGEFQTVKDV 
VLDCLLDFLPEGVNKEKITPLTLKEAYVQKMVKVCNDSDRWSLI 
SLSNNSGKNVELKFVDSLRRQFEFSVDS FQI KLDSLLLFYECSE 
NPMTETFHPTIIGESVYGDFQEAFDHLCNKIIATRNPEEIRGGG 
LLKYCNLLVRGFRPASDEIKTLQRYMCSRFFIDFSDIGEQQRKL 
ESYLQNHFVGLEDRKYEYLMTLHGVVNESTVCLMGHERRQTLNL 
I TMLAIRVLADQNVI PNVANVTCYYQPAPYVADANFSNYYIAQV 
QPVFTCQQQTYSTWLPCN 


5605 


35 


1821 


SQRSCPRSPSSPAPPWARCSNPDSRTGGVPVPRAWSAGGPALGL 
MAAPVRLGRKRPLPACPNPLFVRWLTEWRDEATRSRHRTRFVFQ 
IvALRSLRRYPLPLRSGKEAKILQHFGDGLCRMLDERLQRHRTSG 
GDHAPDSPSGENSPAPQGRLAEVQDSSMPVPAQPKAGGSGSYWP 
ARHSGARVILLVLYREHLNPNGHHFLTKEELLQRCAQKSPRVAP 
GSARPWPALRSLLHRNLVLRTHQPARYSLTPEGLELAQKLAESE 
GLSLLNVGIGPKEPPGEETAVPGAASAELASEAGVQQQPLELRP 
LLCVDIGETRGGGHRPELLRELQRLHVTHTVRKLHVGDF 
VWVAQETNPRDPANPGELVLDHIVERKRLDDLCSSIIDGRFREQ 
KFELXRCGLERRVYLVEEHGSVHNLSLPESTLLQAVTNTQVIDG 
FFVKRTADIKESAAYLALLTRGLQRLYQGHTLRSRPWGTPGNPE 
SGAMT3PNPLCSLLTFSDFNAGAIKNKAQSWEVFARQLMQVRG 
VSGEKAAALVDRYSTPASLLAAYDACATPKEQETLLSTIKCGRL 
CRNLGPALSRTLSQLYCSYGPLT 


5506 


3 


1099 


C-RS RCPGPGARGGTMS PRS CLRS LRLLVFAVFSAAASNWLYLAK 
LSSVGS ISEEETCEKLKGLIORQVQMCKRKLEVMDSVRRGAQLA 
IEECQYQFRNRRWNCSTLDSLPVFGKWTQGTREAAFVYAISSA 
GVAFAVTRACSSGELEKCGCDRTVHGVSPQGFQWSGCSDNIAYG 
VAFSQSFVDVRERSKGASSSRALMNLHNNEAGRKAILTHMRVEC 
KCHGVSGSCEVKTCWRAVPPFRQVGHALKEKFDGATEVEPRRVG 
SSRALVPRNAQFKPHTDEDLVYLEPSPDFCEQDMRSGVLGTRGR 
TCNKTSKAIDGCELLCCGRGFHTAQVELAERCSCKFHWCCFVKC 
RQCQRLVELHTCR 
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SEQ 
ID 
NO: 


Predicted 
beginning 

corresponding 

to first 

sequence 


Predicted end 
corresponding 

sequence 


{A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalajiine , G=Glycine, 
H=Histidine, l=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 


5607 


521 




PPVCNPAEAMP S PGTVCS LLL.LGMLWLDLAMAGSSFLS PEHQRV 
QQRKESKK.PPAKLQPRALAGWLRPEDGGQAEGAEDELEVRPNAP 
FD VG I KLS G VQ YQQHSQALGKFLQD I LWEE AKE APAD K 


5S08 


2 


983 


WFQSPLRQADPGPPRHTLFMDFVAGAIGGVCGDAVGYPLDTVKV 
RIQTEPKYTGIWHCVRDTYHRERVWGFYRGIiLLPVCTVSLVSSE 
VFGTYRHCLAHICRLRFGNPDAKPTKADITLSGCASGLVRVFLT 
SPTEVAKVRLQTQTQAQKQQRRLSASGPLAVPPMCPVPPACPEP 
KYRGPLHCLATVAREEGLCGLYKGSSALVLRDGHSFATYFLSYA 
VLCEWLSPAGH3RPDVPGVLVAGGCAGVLAWAVATPMDVIKSRL 
QADGQGQRRYRGLLHCMVTIVREEGPRVLFKGLVLNCCRAFPVN 
MWFVAYEAVLRLARGLLT 


SS09 


1S23 


304 


AKGVWVLPSPPPRPGRGALVSGSGLRRGRSGTSWRPRRMNHKSK 
KRIREAKRSARPELKDSLDWTRHNYYESFSLSPAAVADNVERAD 
ALQLSVEEFVERYERPYKPWLLNAQEGWSAQEKWTLERLKRKY 
RNQKFKCGEDNDGYSVKMKMKYYIEYMESTRDDSPLYIFDSSYG 
EHPKRRKLLEDYKVPKFFTDDLFQYAGEKRRPPYRWFVMGPPRS 
GTGIHIDPLGTSAWNALVQGHKRWCLFPTSTPRELIKVTRDEGG 
NQQDSAITWFNVIYPRTQLPTWPPEFKPLEILQKPGETVFVPGG 
WWHWLNLDTTIAITQNFASSTNFPWWHKTVRGRPKLSRKWYR 
ILKQEHPELAV1ADSVDLQESTGIASDSSSDSSSSSSSSSSDSD 
SECESGSEGDGTVHRRKKRRTCSMVGNGDTTSQDDCVSKERSSS 
R 


5510 


54 


1196 


LER7PASADMAWTKYQLFLAGLMLVTGSINTI,SAKWADNFMAEG 
CGGSKEHSFQHPFLQAVGMFLGEFSCIAAFYLLRCRAAGQSDSS 
VDPQQPFNPLI.FLPPALCDMTGTSLMYVALNMTSASSFQMLRGA 
VII FTGLFS VAFLGRRLVLSQWLGI LATIAGLVWGLADLLS KH 
DSQHKLS EVI TGDLLI IMAQI I VAI QMVLEEKFVYKHNVHPLRA 
VGTEGLFGFVILSLLLVPMYYIPAGSFSGNPRGTLEDALDAFCQ 
VGQQPLIAVALLGNISSIAFFNFAGISVTKELSATTRMVLDSLR 
TWIWALSLALGWEAFHALQILGFLILLIGTALYNGLHRPLLGR 
LSRGRPLAEESEQERLLGGTRTPINDAS 


5611 


2 


577 


FVLPNRLGIPGSTFRGPGACASSSSLAASAKPGAGGSPALAMSG 
ELSNRFQGGKAFGLLKARQERRLAEINREFLCDQKYSDEENLPE 
KLTAFKEKYMEFDLNNEGEIDLMSLKRMMEKLGVPKTHLEMKKM 
ISEVTGGVSDTISYRDFVNMMLGKRSAVLKLVMMFEGKANESSP 
KPVGPPPERDIASLP 


5612 




721 


ASRDGYMDATIAPHRIPPEMPQYGEENHIFELMQAMWLCKHLNS 
SLLTLENLILNEFSYTATEARRLYLQRKTVPSALLVQLIQERLA 
EEDCIKQGWILDGIPETREQALRIQTLGITPRHVIVLSAPDTVL 
IERNLGKRIDPQTGEIYHTTFDWPPESEIQNRLMVPEDISELET 
AQKLLEYHRNIVRVIPSYPKILKVISADQPCVDVFYQALTYVQS 
NHRTNAPFTPRVLLLGPVGS 


5613 


115 


1279 


RGVDPALRRAEKMLPLSIKDDEYKPPKFNLFGKISGWFRSILSD 
KTSRKLFFFLCLNLSFAFVEIjLYGIWSNGLGLISDS FHMFFDST 
AILAGLAA3VISKWRDNDAFSYGYVRAEVLAGFVNGLFLIFTAF 
FI FSEGVERALAPPDVHHERLLLVSILGFWNLIGI FVFKHGGH 
G r -i 3GHGHSKSLFNGALDQAHGHVDHCHSHEVKHGAAHSHDH 
AHGH --HFH3HDGPSLKETTGPSRQILQGVFLHILADTLGSIGVI 
A3AIKMQNFGLMIADPICSILIAILIWSVIPLLRESVGILMQR 
TPPLLENSLPQCYQRVQQLQGVYSLQEQHFWTLCSDVYVGTLKL 
; U E ^DARWILSQTHNIFTQAGVRQLYVQIDFAAM 


5614 


3 


1268 


LLSRNEHACPLQAGLGLTQRKPKAIRGREGRATNQGQGETQNER 
APWGARQRLGVMAELQQLQEFEIPTGREALRGNHSALLRVADYC 
EDNYVQATDKRKALEETMAFTTQALASVAYQVGNLAGHTLRMLD 
L QGAALRQ VE ARVS TLGQMVNMHMEKVARRE I GTLAT VQRLP PG 
QKVIAPENLPPLTPYCRRPLNFGCLDDIGHGIKDLSTQLSRTGT 
LSRKS IKAPATPASATLGRP PRI PEPVHLPWPDGRLSAASSAS 
SLASAGSAEGVGGAPTPKGQAAPPAPPLPSSLDPPPPPAAVEVF 
QRPPTLEELSPPPPDEELPLPLDLPPPPPLDGDELGLPPPPPGF 
C-PDEPSWVPA3YLEKWTLYPYTSQKDNELSFSEGTVICVTRRY 



351 



WO 01/53312 



PCT/US00/34263 



Predicted 
beginning 
lcleotide 



sidue of 
amino acid 
sequence 



corresponding 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Fhenylalanine, G=Glycine, 
H=Histidine, I=Iscleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine r 

W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 
S DGWCEGVS SEGTGFF PGNYVE P S C 



ALGRRRPGDPREMEAAATPAAAGAARKEELDMDVMRPLINEQNF 
DGTSDEEHEQELT.PVQKHYQLDDQEGISFVQTLMHLLKGNIGTG 
LLGLPLAIKNAGIVLGPISLVFIGIISVHCMHILVRCSHFLCLR 
FKXSTLGYSDTVSFAMEVSPWSCLQKQAAWGRSVVDFFLVITQL 
GFCSVYIVFLAENVKQVHEGFLESKVF1SNSTNSSNPCERRSVD 
LRIYMLCFLPFIILLVFIRELKNLFVLSFLANVSMAVSLVIIYQ 
YWRNMPDPHNLPIVAGWKKYPLFFGTAVFAFEGIGWLPLENQ 
MKESKRFPQALNIGMGIVTTLYVTLATLGYMCFHDEIKGSITLN 
LPQDVWLYQSVKILYSFGIFVTYSIQFYVPAEIIIPGITSKFHT 
KWKQICEFGIRSFLVSITGAGAILIPRLDIVISFVGAVSSSTLA 
LILPPLVEILTFS KEHYNI WM VLKN I S I AFTG WG FLLGTYITV 
EEIIYPTPKVVAGTPQSPFLNLNSTCLT3GLK 



DDFVRCGPQSAAMGASARLLRAVIMGAPGSGKGTVSSRITTHFE 
LKHLSSGDLLRDNMLRGTEIGVLAKAFIDQGKLIPDDVMTRLAL 
HELKNLTQYSWLLDGFPRTLFQAEALDRAYQIDTVINLHVPFEV 
IKQRLTARWIHPASGRVYNIEFNPPKTVGIDDLTGEPLIQREDD 
KPETVIKRLKAYEDQTKPVLEYYQKKGVLETFSGTETNKIWPYV 
YAFLQTKVPQRSQKASVTP 

PWRGRGSRPRGAGAMAEEQVNRSAGLAPD CEASATAE XT VS S VG 
TCEAAGKSPEPKDYDSTCVFCRIAGRQDPGTELLHCENEDLICF 
KDIKPAATHHYljWPKKHIGNCRTLRSDQVELVENMVTVGKTll, 
ERNNFTDFTWVRMGFHMPPFCSISHIiHLHVLAPVDQLGFLSKLV 
YRWS Y WF I TADHL IEKLRT 

YLNYIKLKSEIIKr,SGKEDLWEKLQYLWKSTLNLP3DLLRVPDES 

LFLKSGGDSLKS1RLLSEIEKLVGTSVPGLLEIILSSS1LEIYN 

HILQTWPDEDVTFRKSCATKRKLSNINQEEASGTSLHQKAIMT 

FTCHNEINAFWLSRGSQILSLNSTRFLTKLGHCSSACPSDSVS 

QTNIQNLKGLMSPVLIGKSKDPSCVAKVSEEGKPAIGTQKMELH 

VRWRSDTGKCVDASPLWIPTFDKSSTTVY'IGSHSHRMKAVDFY 

SGKVKWEQILGDRIESSACVSKCGNFIWGCYNGLVYVLKSNSG 

EKYWMFTTEDAVKSSATMDPTTGLIYIGSHDQHAYALDIYRKKC 

VWKSKCGGTVFSSPCLNLIPHHLYFATLGGLLLAVNPATGNVIW 

KHS CGKPLFS S PQCCSQYI CIGCVDGNLLCFTHFGEQVWQFS TS 

GPIFSSPCTSPSEQKIFFGSHDCFIYCCNMKGHLQWKFETTSRV 

YATPFAFHNYNGSNEMLLAAASTDGKVWILBSQSGQLQSVYELP 

GEVFSSPWLESMLIIGCRDNYVYCLDLLGGNQK 

DSFVLPTSGNVISTAQPAQPWSAVEAALRSljGSPPGAGRGCPCP 

AQSLHSHQLAAWDPT.KPSTiRSYPPHLLQHPQIRSLTASSGHLGR 

RSCPQPRPLEELLRAGSSTRPQPLTSSCCGMSCMYSFLGHCSVL 

LWGTKGRGSGSPSSPGCCLHPPAQHSQDLPLVHVDVGWQPPLGP 

TVGLRPGLLGERQRGALRAGDPQCQCPLPATVREDLGVPSPWAA 

ECSPPATP 

PLPPPTLAMFLTRSEYDRGVNTFSPEGRLFQVEYAIEAIKLGST ■ 

AIGIQTSEGVCLAVEKRITSPLMEPSSIEKIVEIDAHIGCAMSG 

LIADAKTLIDKARVETQNHWFTYNETMTVESVTQAVSNLALQFG 

EEDADPGAMSRPFGVALLFGGVDEKGPQLFHMDPSGTFVQCDAR 

AIGSASEGAQSSLQEVYHKSMTLKEAIKSSLIILKQVMEEKLNA 

TNIELATVQPGQNFHMFTKEELEEVIKDI 

WEFVEYTATDANVKNESLSSVQQLGIKMTVRYGKFLSLLKDGA 
ENDLTKVLKHCERFLKQQQTSIKSSLLCLQGNYAGHDWFVSSLF 
MIMLGDKEKTFQFLHQFSRLLTSAFLWLPRLHISSYLPHDTVES 
GIHPVYFCSTHYIEMLLKAELPLVFSAFHMSGFAPSQICLQWIT 
QCFWNYLDWIEICHYIATCVFLGPDYQVYICIAVFKHLQQDILQ 
HTQTQDLQVFLKEEALHGFRVSDYFEYMEILEQNYRTVLLRDMR 
NIRLQST 

AAS TKEAVSRKRSHSAS EKSGTGTS ISKRLNMNPQ IRNPMKAMY 
PGTFYFQFKNLWEANDRNETWLCFTVEGIKRRSWSWKTGVFRN 
QVDSETHCHAERCFLSWFCDDILSPNTKYQVTWYTSWSPCPDCA 
GEVAEFLARHSNVNLTIFTARLYYFQYPCYQEGLRSLSQEGVAV 
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SEQ 
ID 
NO: 


beginning 
nucleotide 
location 
corresponding 
to first 
amino acid 

amino acid 


Predicted end 

corre spending 
to first 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








EIMDYEDFKYCWENFVYNDNEPFKPWKGLKTITFRLLKRRLRESL 
Q 


5623 




954 


FLPFFIRAPKISRNGQWLFTFTTPFPFANKALPGWEGIVPACFW 
RKKILTPSTGTMELLQVTILFLLPSICSSNSTGVLEAANNSLW 
TTTKPSITTPIITESLQKNWTPTTGTTPKGTITNSLLKMSLMST 
ATFLTSKDEGLKATTTDVRKNDSI1SKVTVTSVTLPNAVSTLQS 
SKPKTETQSSIKTTEIPGSVLOPDASPSKTGTLTSIPVTIPENT 
SQSQVIGTEGGKNASTSATSRSYSSIILPWIALIVITLSVFVL 
VGLYRMCWKADPGTPENGNDQPQSDKESVKLLTVXTISHESGEH 
SAQGKTKN 


5S24 


159 


898 


PGVAAAAGALPQYHGPAPALVSCRRELSLSAGSLQLERKRRDFT 
SSGSRKLYFDTHALVCLLEDNGFATQQAEIIVSALVKILEANMD 
IVYKDMVTKMQQEITFQQVMSQIANVKKDM1ILEKSEFSALRAE 
NEKIKLELHQLKQQVMDEVIKVRTDTICLDFNLEKSRVKELYSLN 
EKKLLELRTEIVALHAQQDRALTQTDRKIETEVAGLKTMLESHK 
LDNIKYIAGSIFTCLTVALGFYR1WI 


5625 




11BD 


TIPS S AAAQRAGP PAGALEALS PGGARAHAEKRGEMRATPIAAP 
AGSLSRKKRLELDDNLDTERPVQKRARSGPQPRLPPCLLPLSPP 
TAPDRATAVATASRLGPYVLL2PEEGGRAYQALHCPTGTEYTCR 
VYPVQEALAVLEPYARLPPHKHVARPTEVLAC-TQLLYAFFTRTH 
3DMHSLVRSRHRIPEPEAAVL?RQMATALAHCHQHGLVLRDLKL 
CRFVFADRERKKLVLENLEDSCVLTGPDDSLWDKHACPAYVGPE 
ILSSRASYSGKAADVWSLGVALFTMLAGHYPFQDSEPVLLFGKI 
RRGAYALPAGLSAPARCLVRCLLRREPAERLTATGILLHPWLRQ 
DPKPLAPTRSKLWEAAQWPDGLGLDRAREEEGDREWLYG 


562S 


3123 


2011 


ITPIIVYWAQRIIRELYLRVELSDVQNPAISI 
TENVLHFK.AQGHGAKGDNVYEFHLEFLDLVKPEPYYKLTCRQVN 
ITVQKKVSQWWERLTKQEKRPLFLAPDFDRWLDESDAEMELRAK 
EEERLNKLRLESEGSPETLTNLRKGYLFMYNLVQFLGFSWIFVN 
LTVRFCILGKESFYDTFHTVADMMYFCQMLAWETINAAIGVTT 
SPVLPS L I QLLGR1JFI LF 1 1 FGTMEEMQNKAWFFVFYLWSAIE 
IFRYSFYMLTCIDMDWKVLTWLRYTLKIPLYPLGCLAEAVSVIQ 
SIPIFNETGRFSFTLPYPVKIKVRFSFFLQIYLIMIFLGLYINF 
RHLYKQRRRRYGQKKKKIH 


5S27 


3123 


2011 


PPRALGSVAMENQVLTPHVYWAQRHRELYLRVELSDVQNPAISI 
TSWLHFKAQGHGAKGDNVYEFHLEFLDLVKPEPVYKLTQRQVN 
"TV2KKVSQWWERLTKQEKRPLFLAPDFDRWLDESDAEMELRAK 
EEERL^KLRLESEGSPETLTNLRKGYLFMYNLVQFLGFSWIFVN 
LTVRFCTLGKESFYDTFHOTADMMYFCQMLAWETINAAIGVTT 
SPVLPSLIQLLGRNFILFIIFGTMEEMQNKAWFFVFYLWSAIE 
IFRYSFYMLTCIDMDWKVLTWLRYTLWIPLYPLGCLAEAVSVIQ 
S1PIFNETGRFSFTLPYPVK1KVRFSFFLQIYLIMIFLGLYINF 
RHIiYKCRRRRYGQKKKKIH 


5628 ' 


75 


1455 


VAGAMASKCLKAGFSSGSLKSPGGASGGSTRVSAMYSSSPCKLP 
SLSPVARSFSACSVGLGRSSYRATSCLPALCLPAGGFATSYSGG 
GGW FGEG I LTGNEKE TMQS LNDRLAGYLE KVRQLEQENAS LESR 
IREWCEQQVPY.MCPDYQSYFRTIEELQKKTLCSKAENARLWEI 
DNAXLAADDFRTKYETEVSLRQLVESDINGLRRILDDLTLCKSD 
LEAQVESLKEEEiLCLKKNHEEEVNSLRCQLGDRLNVEVDAAPPV 
DLNRVLEEMRCQYETLVENNRRDAEDWLDTQSEELNQQWSSSE 
QLQS CQAE 1 1 ELRRTVNALE IELQAQHSMRDALESTLAETEARY 
S SQLAQMQCM I TNVEAQLAE IRADLERQNQE YQVLLDVRARLEC 
EINTYRGLLESEDSKLPCNPCAPDYSPSKSCLPCLPAASCGPSA 
ARTNCSARPI CVPCPGGRF 


5629 


2287 


938 


GRPRSSSDNRNFLRERAGLSSAAVQTRIGNSAASRRSPAARPPV 
PAPPALPRGRPGTEGSTSLSAPAVLWAVAVWVWSAVAWAMA 
NYIHVPPGSPEVPKLOTTVQDQEEHRCREGALSLLQHLRPHWDP 
QEVTLQLFTDGITNKLIGCYVGWTMEDWLVRIYGNKTELLVDR 
DEEVKSFRVLQAHGCAPQLYCTFNNGLCYKFIQGEALDPKHVCN 
FAIFRL IARQLAKIHAIHAHNGWI PKSNLWLKMGKYFSL IPTGF 



353 



PCT/US00/34263 



Predicted end 
nucleotide 

cor re sponding 
to first 

residue of 



Amino acid segment containing signal peptide 
{A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, l=lsoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible 



ADEDINKRFLSDIPSSQILQEEMTWMKEILSNLGSPWLCHNDL 
LCKNIIYNEKQGDVQFIDYEYSGYNYLAYDIGNHFNEFAGVSDV 
DYSLYPDRELQSQWLRAYLEAYKEFKGFGTEVTEKEVEILFIQV 
NQFAIASHFFWGLWALIQAKYSTIEFDFLGYAIVRFNQYFKMKP 
EVTALKVPE 



GFWAIAQTCAHHLPPGSPWLVPASPWRLPEMSSFGYRTLTVALF 
TLICCPGSDEKVFEVHVRPKKLAVEPKGSLEVNCSTTCNQPEVG 
GLETSLDKILLDEQAQWKHYLVSNISHDTVLQCHFTCSGKQESM 
NSNVSVYQPPRQVILTLQPTLVAVGKSFTIECRVPTVEPLDSLT 
LFLFRGNETLHYETFGKAAPAPQEATATFNSTADREDGHRNFSC 
LAVLDLMSRGGNI FHKHSAPKMLE I YEPVSDSQMVI IVTWSVL 
LSLFVTSVLLCFIFGQHLRQQRMGTYGVRAAWRRLPQAFRP 



SRVDDFVRPEPSRAEPSRfat^ iUx t-'h 1 \ • SVFGKLFGAGGGK 
AC-KGGPTPQEAIQRLRDTEEMLSKKQEFLEKKIEQELTAAKKHG 
TKNKRAALQALKRKKRYEKQLAQIDGTLSTIEFQREALENANTN 
TEVLKNMGYAAKAMKAAHDNMDIDKVDELMQDIADQQELAEEIS 
TAISKPVGFGEEFDEDELMAELEELEQEELDKNLLEISGPETVP 
IALPSKPAKKKEEEDDDMKELENWAGSM 



WLGWSPPRRLWWGSLGAAORPAVPVSGIARSLHVkTRWHRRA" 
STOVARGRLGVWAQPQPLLPRPVGSRREMQPPGPPPAYAPTNGD 
FTFVSSADAEnT,SGSTASPDVKLNLGGDFIKESTATTFLRQRGY 
GWLL3VEDDDPEDNKPLLEBLDIDLKDIYYKIRCVLMPMESLGP 
33RQWRDNPDFWGPLAWLFFSMISLYGQFRWSWIITIWIFGS 
LTIFLLARVLGGEVAYGQVLGVIGYSLLPLIVIAPVLLWGSPE 
VVSTLIKLFGVFWAAYGAAGLLVGEEFKTKKPLLIYPIFLT.YIY 
FLSLYTGV 



QGCSKTMSVGRPFYRSSEFMEQLLSSHLHQVPl-iC •"! 1 TT 
CLFENSVSKLYMLCFNFFMSIFFYSLSITKLNLIYLWGLSYQSL 
LLLLLSGHRPWGSSMV 



PRATGRIRSRAAASRPRAGAGASGAEPRSGRERSRLSGRRAPAM 
ARNTLSSRFRRVDIDEFDENKFVDEQEEAAAAAAEPGPDPSEVD 
GLLRQGDMLRAFHAALRNSPVNTKNQAVKERAQGWIjKVLTNFK 
SSEIEQAVQSLDRNGVDLLMKYIYKGFEKPTENSSAVLLQWHEK 
ALAVGGLGS IIRVLTARKTV 



DRGPRSTATDTGRARVSFWRFPLDPGVKNSNVQISGEKRRFRTL 
RSLFHPFPVTRSGAPRAVLVGSSWPAKMVAPAVKVARGWSGLAL 
GVRRAVLQLPGLTQVRWSRYSPEFKDPLIDKEYYRKPVEELTEE 
EKYVRELKKTQLIKAAPAGKTSSVFEDPVISKFTNMMMIGGNKV 
IARSLMIQTLEAVKRKQFEKYHAASAEEQATIERNPYTIFHQAL 
KNCEPMIGLVPILKGGRFYQVPVPLPDRRRRFLAMKWMITECRD 
KKHQRTLMPEKLSHKLLEAFHNQGPVIKRKHDLHKMAEANRALA 
HYRWW 



LEDTI CQHP PAEKKLYLYHRKLREVERNG I PRLPKDVFMDTHQG 
LTDVRAKVTGFSEGWDSVKGGFSSFSQATHSAAGAWSKPREI 
ASLIRNKFGSADNIPNLKDSLEEGQVDDAGKALGVISNFQSSPK 
YGSEEDCS SATSGSVGANSTTGGI AVGASS S KTNTLDMQS SGFD 
ALLHEIQEIRETQARLEESFETLKEHYCRDYSLIMQTLQEERYR 
CERLEEQLNDLTELHQNEILNLKQELASMEEKIAYQSYERARDI 
QEALEACQTRISKMELQQQQQQWQLEGLENATARNLLGKLINI 
LLAVMAVLLVFVSTVANCWPLMKTRNRTFSTLFLWFIAFLWK 
HWDALFSYVERFFSSPR 



MSFCGARANAKMMAAYNGGTSAAAAGHHHHHHHHLPHLPPPHLH 
HHHHPQHHLHPGSAAAVHPVQQHTSSAAAAAAAAAAAAAMLNPG 
QQQPYFPSPAPGQAPGPAAAAPAQVQAAAAATVKAHHHQHSHHP 
QQQLDIEPDRPIGYGAFGWWSVTDPRDGKRVALKKMPNVFQNL 
VSCKRVFRELKMLCFFKHDMVLSALDILQPPHIDYFEE1YWTE 
LMQSDLHKI1VSPQPLSSDHVKVFLYQILHGLKYLHSAGILHRD 
I KPGNLLWSNCVLKI CDFGLARVEELDESRHMTQE WTQYYRA 
PEILMGSRHYSNAIDIWSVGCIFAELLGRRILFQAQSPIQQLDL 
ITDLLGTPSLEAMRTACEGAKAHILRGPHKQPSLPVLYTLSSQA 



354 



WO 01/53312 



PCT/US00/34263 



ID 


beginning 
nucleotide 

corresponding 

amino acid 
sequence 


Predicted end 

residue of 
amino acid 
secnience 


(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








THEAVHLLCRMLVFDPYKRISAKDALAHPYLDEGRLRYHTCMCK 
CCFSTSTGRVYTSDFEPVTNPKFDDTFEKNLSSVRQVKEIIHQF 
ILEQQKGKRVPLCINPQSAAFKSFISSTVAQPSEMPPSPLVWE 


5638 


125 


1155 


DRKMSELDQLRQEAEQLKNQIRDARKACADATLSQITNHIDPVG 
R1QMRTRRTLRGHLAKIYAMHWGTDSRLLVSASQDGKLIIWDSY 
TTNKVHAIPLRSSWVMTCAYAPSGNYVACGGLDNICSIYNLKTR 

GQQTTTFTGHTGDVMSLSLAPDTRLFVSGACDASAKLWDVREGM 
CRQTFTGHESDINAICFFPNGNAFATGSDDATCRLFDLRADQEL 
MTYSHDNI ICGITSVSFS KSGRLLLAGYDDFNCNVWDALKADRA 
GVLAGHDNRVSCLGVTDDGMAVATGSWDSFLKIWN 


5S39 


125. 


_155 


DRKMSELDQLRQEAEQLKNQIRDARKACADATLSQITNNIDPVG 
R I QMRTRRTLRGHLAKI YAMHWGTDSRLLVSASQDGKL IIWDSY 
TTNKVHAI PLRS SWVMTCAYAPSGNYVACGGLDNI CS I YNLKTR 
EGNVRVSRELAGHTGYLSCCRFLDDIJQIVTSSGDTTCALWDIET 
GQQTTTFTGHTGDVMSLSLAPDTRLFV3GACDASAKLWDVREGM 

MTYSHDNIICGITSVSFSKSGRLLLAGYDDFNCNVWDAT.KADHA 
GVLAGHDNRVSCLRVTl>nRMAVATfi.SWnSFI,KIWN 


5G40 


280 


.10 92 


QCGNKKTMLSHHTMMKQRKQQATAIMKSTOGNDVDGMDLGKKVS 
IFRDIMLEELSHLSNRGARLFKMKQRRSDKYTFENFQYQSRAQI 
NHSIAMQNGKVDGSNLEGGSQQAPLTPPNTPDPRSPPNPDNIAP 
GYSGPLKEIPPEKFNTTAVPKYYQSPWEQAISNDPELLEALYPK 
LFKPEGKAELPDYRSFNRVATPFGGFEKASRMVKFKVPDFELLL 

VPESEDL 


5641 




332 


2RHNCKGDVKLLSNQMDKLFAFHLFTFHGLLHFLDGSIQKLIQA . 

EIILSDNSSILVLENNFLFKVKSKQFIHLIAKKFYISITIVSAS 

NGESFVLSMIVTG 


5642 


199 


1247 


ITPCRMDFLVLFLFYLASVLMGLVLICVCSKTHSLKGLARGGAQ 
I FS CI I PECLQRAMHGLLHYLFHTRNHTFIVLHLVLQGMVYTEY 

KANELLFLHVYEFDEVMFPKNVRCSTCDLRKPARSKHCSVGNWC 
VHRFDHHCVWVNNCIGAWNIRYFLIYVLTLTASAATVAIVSTTF 
LVHLWMSDLYQETYIDDLGHLHVMDTVFLIQYLFLTFPRIVFM 
LGFWVLSFLLGGYLLFVLYLAATNQTTNEtTYRGDWAWCQRCPL 
VAWPPSAEPQVHRNIHSHGLRSNLQEIFLPAFPCHERKKQE 


5643 


1 


847 


PSGC-VRDVETRGPGSRAARGPRWMKRRGVGAGAIAKKKLAEAK 
YKERGTVLAEDQIAQMSKQLDMFKTNLEEFASKHKQEIRKNPEF 

jALKHRNGGLITLEELHQQVLKGRGKFAQDVSQDDLIRAIKKLK 
ALGTGFG 1 1 PVGGTYLIQSVPAELNMDHTWLQLAEKNGYVTVS 
EIK? "LKWETERARQVLEHLLKEGLAWLDLQAPGEAHYWLPALF 
TDLYSQE ITAEEAREALP 


5644 


83 


1138 


PRRMGSWQLITSVGVQQNHPGWTVAGQFQEKKRFTEEVIEYFQ 

YVAIEDKDMQQKEQQFREWFLKEFPQIRWKIQESIERLRVIANE 
IEKVHRGCVIARVVSGSTGILSVIGVMIAPFTAGLSLSITAAGV 
GLGIASATAGIASSIVENTYTRSAELTASRLTATSTDQLEALRD 
ILHDITPNVLSFALDFDEATKMIANDVHTLRRSKATVGRPLIAW 
RYVPINWETLRTRGAPTRIVRKVARNLGKATSGVLWLDWNL 
VQDSLDLHKGEKSESAELLROWAQELEENLNELTHIHQSLKAG 


5645 


537 


799 


VQSVRDLKRLSPTDPPGDSGNRDVTREDPVTGPLNSASSQVPTL 
YLCLQKSLLGHSSVEDARATMELYQISQRIRARRGLPRLAVSD 


5646 


3745 


3328 


AEQYGTSPHLLPTMLLSSCLPPANVTTKAATPPPLVLSLTTADP 
AGKPAPCRVTLTLLRAS I PATKRAS FLS S FI KMFFEELE YILGF 
LSLLKFHVHVSVYSAICHFQKEGTGNSRSFTCTPELFPRLQTHL 
RASGGAQ 


5647 


288 


800 


GVIMATSELSCSySEENCERREAFWAEWKDLTLSTRPEEGCSLH 
EEDTQRHETYHQQGQCQVLVQRSPWLMMRMGILGRGLQEYQLPY 
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SEQ 
ID 
NO: 


Predicted 
beginning 

corresponding 

residue of 


Predicted end 
nucleotide 

corresponding 
to first 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosins, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








QEVLPLPIFTPAKMGATKEEREDTPIQLQELLALETALGGQCVD 
RQEVAEITKQLPPWPVSKPGALRRSLSRSMSQEAQRG 


564 8 


7 


1518 


VLSELCGRHEALREVGAEWPPPTCSPKICSGLQQAGNTDWSLTM 
APQSLPSSR^4APLGMLLGLLMAACFTFCLSHQNLKEFALTWPEK 
SSTXETERKETKAEEELDAEVLEVFHPTHEWQALQPGQAVPAGS 
HVRLNLQTGEREAKLQYEDKFRNNLKGKRLDINTNTYTSQDLKS 
AIAKFKEGAEMESSKEDKARQAEVKRLFRPIEELKKDFDELNW 
IETDMQIMVRLINKFNSSSSSLEEKIAALFDLEYYVHQMDNAQD 
LLSFGGLQWINGLNSTEPLVKEYA^FVLGAAFSSNPKVQVEAI 
EGGALQKLLVILATEQPLTAKKKVLFALCSLLRHFPYAQRQFLK 
LGGLQVLRTLVQEKGTEVLAVRWTLLYDLVTEKMFAEEEAELT 
QEMSPEKLQQYRQVHLLPGLWEQGWCEITVHLLALPEHDAREKV 
LQTLGVLLTTCRDRYRQDPQLGRCLASLQAEYQVLASLELQDGE 
DEGYFQELLGSVNSLLKELR 


5649 


1172 


3006 


MLQEQLDAINEEIRMIQEEKESTELRAEEIETRVTSGSMEALNL 

MGVMTLPSDLRKHRRKLLSPVSREENREDKATIKCETSPPSSPR 
TLRLEKLGHPALSQEEGKSALEDQGSNPSSSNSSQDRLHKGAKH 
KGIKSSIGRLFGKKEKGRLIQLSRDGATGHVLLTDSEFSMQEPM 
VPAKLGTQAEKDRRLKKKHQr.I.EDARRKGMPFAQWDGPTWSWL 
ELWVGMPAWYVAACRANVKSGAIMSAL3DTEIQRE1GISNALI-IR 
LKT.RI.AIQEMVSLTSPSAPPTSRTSSGNVWVTIIEEMETLETSTK 
TDSEEGSWAQTLAYGDMNHEWIGNEWLPSLGLPQYRSYFMECLV 
DARMLDHLTKKDLRVHLKMVDSFHRTSLQYGIMCLKRLNYDRKE 
I I Li U IT I /LVWTNDQWHWVQSIGLRDYAGWLHESGV 
HGALLALDENFDHNTLALILQIPTQNTQARCVMEREFNNLLALG 
TDRKLDDGDDKVFRRAPSWRKRFRPREHH3RGGMLSASAETLPA 
GFRVSTLGTLQPPPAPPKKIMPEAHSHYLYGHMLSAFRD 


5G50 


1172 


3006 


MLQE >L A1NEEIRMIQEEKESTELRAEEIETRVTSGSMEALNL 
KQLRKRGSIPTSLTDLSLASASPPLSGRSTPKLTSRSAAQDLDR 
MGVMTLPSDLRKHRRKLLSPVSREENREDKATIKCETSPPSSPR 
TLRLEKLGHPALSQEEGKSALEDQGSNPS3SNSSQDSLHKGAKR 
KGIKSSIGRLFGKKEKGRLIQLSRDGATGHVLLTDSEFSMQEPM 
VPAKLGTQAEKDRRLKKKHQLLEDARRKGMPFAQWDGPTWSWL 
ELWGMPAWYVAACRAOTKSGAIMSALSDTEIQREIGISNALHR 
LKLRLAIQEMVSLTSPSAPPTSRTSSGNVWVTHEEMETLETSTK 
TDSEEGSWAQTLAYGDMNHEWIGNEWLPSLGLPQYRSYFMECLV 
DARMLDHLTKKDLRVHLF0WDSFHRTSLQYG1MCLKRLNYDRKE 
LEKRREESQHEIKDVLVWTNDQWHWVQSIGLRDYAGNLHESGV 
HGALLALDENFDHNTLALILQIPTQNTQARQVMEREFNWLLALG 
TDRKLDDGDDKVFRRAPSWRKRFRPREHHGRGGMLSASAETLPA 
GFRVSTLGTLQPPPAPPKKIMPEAHSHYLYGHMLSAFRD 


5S51 


646 


1869 


ARQGQRQPWG*EARAKGPASESPRV*EGSGWEGPASP*TPGSTL 
AWGEGAGIR*ASGLTAAGAASAAAA/PPPTRGGPAPAGCGRAPP 
WPAPLRVPTHGRAPAPRSRAAPRAPALSHGTAAAALSPASPAGP 
ADP*LPGHSSQSPPRG*RWGRSRSAPAPAHPEHPAPAGSASASQ 
QTPGWPGSCCLAQGWQAEPLGAPGAEDG\PVPPQRGFPLGTLGS 
• • • 3LAGYG*AGAPGTQATAPRAAGQTPVAAAPNCRV*GSA 
PALHRAPAAADPGSPLQAPPRAWASPAAAGPGLSSSDYCGGLGA 
bWKAGISPELLGAAGLSDNWARCPGPGPAE*GGQPGCRTrPASA 
CMPSP?VEC-SLGLSRKGHGDLPSQAR*GWEECRRARHLVPLPRL 
LGPRGRTGRPSSPS 


5652 


735 


343 


HHK . -'2-IIHQKS FSCPEPACGKSFNFKKHLKEHMKLHSDTRDYI 
CE FCARS FRTSSNLVIHRRIHTGEKPLQCE ICGFTCRQKASLNW 
HQEKHAETVAALRFPCEFCGKRFEKPDSVAAHRSKSHPALLLA 


5653 


66 


1401 


RGRLQSRGRLTLGLVLLLLDILGARQHGQRVSHGWKGGFLTAPL 
CFEQPCQPGTRRGRRRSLKEATEPQLAMAEEFVTLKDVGMDFTL 
GDKEQLGLEQGDTFWDTALDNCQDLFLLDPPRPNLTSHPDGSED 
LEELAGGSPEATSPDVTETKNSPLMEDFFEEGFSQEI/SRDVIQ 
G'A'LLELQ?RRSLYRGHLVR*FARRSRKSSEV*YCHQRGKSHGMQ | 



356 



WO 01/53312 



PCT/US00/34263 



ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 


Predicted end 
nucleotide 

corresponding 


Aiti-no acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, -R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Try?tophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion] 








ES*IK3RTQSCVHRFHGRRFHG\DOTSEKTLTPAKSKEYRGEFF 
SYSDHSQQDSVQE3EKPYQCSECGKSFSGSYRLTQHWITHTREK 
PT\fHQSCEQGFDRKASHSGYPKTHTGYKFYVCWEYGTPFSQSTY 
LWHQKTHAGEKPCKSQDSDHPPSHDTQSGEHQKTHTDSKSYNCN 
ECGKAFTRIFHLTRHQKIHTRKRYEC3KCQATFNLRKHLIQHQK 
THAANV 


5S54 




598 


TLPLFPGRRFRGWRRCGAVAARKNSTGGNVSIKTQRRDSVRMSAL 
NWKPFVYGGIASITAECGTFPIDLTKTRFQIQGQTNDAKFKEII 
YRGMLHALVRrGREEGLKALYSG*VGLHAFLCHCSLFHMGIDFR 
PRLHRSQVKSLRCV*KEQIA**/HFSLLISTLISKYIYYAADVL 
EKLFYYIQVQTDNNKKICLFKNI 


5S5S 




867 


RPPGIRAPRQLHPAAGRRPDASARPRFRPTVLLHDPFQLSFPPP 
PLSYPSVFPAVARVLPQRSGDYRAAGMPCLSGGGGGGGGDPELC 
ATDEMIPFKDEGDPQ\REKIFAEIVNPEEEGDLADIKSSLVNES 
EIIPASNGHEVARQAQTSQEPYHDKAREHPDDGKHPDGGLYNKG 
PSYSSYSGYIMMPNMNNDPYMSNGSLSPPIPRTSNKVPVVQPSH 
AVKPLTPLITYSDEHFSPGSHPSHIPSDVNSKQGMSRHPPAPDI 
PTFYPLSPGGGGQITPPLGWQGQP 


5656 


228 


1066 


PRRVPPLPEFASGPGAAFFHSGRLQRSJVTKDSAGCFSQCRSRAM 
LVLRSGLTKALASRTIAPQVCSSFATGPRQYDGTFYEFRTYYLK 
PSNMNAFMENLKKN1HLRTSYSELVGFWSVEFGGRTNKVFHIWK 
YDNFPHRAEVRKA1ANCKEWQKQSIIPNLARIDKQETEITYLIP 
WSKLQKPPKEGVYELAVFQMKPGGPALWGDAFERAINAHVNLGY 
TKWGVFHTEYGELNEVHVLWrtNESADSRAAVRHKSHEDP I SWG 
GVRESVNYL\VSQQNM 


5S57 


105 


1052 


GQRLQSPRVQMPVQPPSKDTEEMFAEGDSAAEMNGEEEESEEER 
SGSQTESEEESSEI^DDEDYERRRSECVSEMLDLEKQFSELKEXL 
FRERLSQLRLRLEEVGAERAPEYTEPLGGLQRSLKIRIQVAGIY 
KGFCLDVIRNKYECELQGAKQHLESEKLLLYDTLQGELQERIQR 
LEEDRQSLDLSSEWWDDKLHARGSSRSWDSLPPSKRKKAPLVSG 
PYIVYMLQEIDILEDWTAIKKARAAVSPQKRKSD\DLDPAVHSQ 
GDPQSSWHCTQDSRLPPADRRTHRPLRVCPARLLWCCWALPLHL 
ALVWTPPL 


S658 


234S 


3541 


TERRVYNPWPEPDPD\CIQEDPWNLPNS I KTLVDNI QRYVEDGK 
NQLLLALLKCTDTELQLRRDAIFCQALVAAVCTFSEQLLAALGY 
RYNNNGEYEESSRDASRKWLEQVAATGVLLHCQSLLSPATVKEE 
RTMLEDIWVTLSELDNVTFSFKQLDEWYVANTNVFYHIEGSRQA 
LKVIFYLDSYHFSKLPSRLEGGASLRLHTALFTKVLENVEGLPS 
PGSQAAEDLQQDIWAQSLEKVQQYYRKLRAFYLERSNLPTDAST 
TAVKIDQLIRFINALDELCRLMKSFVHPKPGAAGSVGAGLIPZS 
SELCYRLGACQMVMCGTGMQRSTLSVSLEQAAILARSHGLLPKC 
IMQATDIMRKQGPRVEILAKNLRVKDQMPQGAPRLYRLCQPKMN 
GDL 


5659 


2 


696 


iWjrSGEVSPKGELGAWRGNSGRPKIIGRAAEAENEDRTLGRLLP 
GNERSQPRSPLRLIAPQLKAEAAADKGLA?VPPPFSSGHSGPC\ 
ERE3EGQRGRGRSRRGAHLELKPSPGLRAGAPTDRGRGGPAEVA 
AAGGRRMVQKESQATLEERESELSSNPAASAGASLEPPAAPAPG 
EDNPAC-AGG\AAVAGAAGGARRFLCGWEGFYGRPWVMEQRKEL 
FRRLQKWELNTYL 


5660 


229 


853 


PVTMKAFSELPMPLLINLIVSLLGFVATVTLIPAFRGHFIAARL 
CGQDLNKTSRQQIPESQGVISGAVFLIILFCFIPFPFLNCFVKE 
QRKAFPHHEFVALIGALLAICCMIFLGFADDVLNLRWRHKLLLP 
TAASLPLLMVYFTNFGNTTIWPKPFRPILGLHLDLGR*SYHCC 
PYGTYFREPFLVLHILLQVFLFCLCVFPDPFW 


5661 




473 


LNLY?SPCGGIPKLPGI,PREAAAALGASFLAEAPLPVTVRGSGL 
AGMAVTCDPKAFLS I CFVTLVFLQLPLAS ICQN*GTDSCASRGK 
ADFDVTGPHAPILAMAGGHVELQCQLFPWISAEDKELRWYRCQP 
SLA'i/HMHERGMDMDGEQKWQYRGRT 


56E2 


2 


1318 


LRK3GRCRRGSNRGVWAAPAEGLGGRGMLGVRCLLRSVRFCSSA 
PFPKHKPSAKLSVRDALGAQNASGERIKIQGWIRSVRSQKEVLF 
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ID 


Predicted 
beginning 
nucleotide 

corresponding 

amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 


Am^no acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Iysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptcphan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=pos3ible nucleotide deletion, 
\=possible nucleotide insertion) 








LHVNDGSSLESLQWADSGLDSRELTFGSSVEVQGQLIKSPSKR 
QNVELXAEKIKVIGNCDAKDFPIKYKERHPLEYLRQYPHFRCRT 
NVLGSILRIRSEATAAIHSFFKDSGFVHIHTPIITSNDSEGAGE 
LFQLEPSGKLKVPEENFFNVPAFLTV8GQLHLEVMSGAFTQVFT 
FGPTFRAENSQSRRHLAEFYMIEAEISFVDSLQDLMQVIEELFK 
ATTM^VLSKCPEDVELCHKFIAPGQKDRL*HMLKNNFLIISYTE 
AVEILKQASQNFTFTPEWGADLRTEHEKYLVKHCGNIPVFVINY 
PLTLKPFYMRDNEDGFQELEGSVA*HSLGLMILLSIWIGQP 


5653 


119 


698 


PADIGRSTAKTPGPPRSLEMDDPRYGMCPLKGASGCPGAERSLL 
VQSYFEKGPLTFRDVAIEFSLEEWQCLDSAQQGLYRKVMLENYR 
NLVFLGIALTKPDLITCLEQGKEPWNIRRHEMVAKPPVICSHFP 
QDLWAEQDIKDSFQEAILKKY3KYGHANFQLQKGCKSVDECKVH 
KEHDNKLNQCLI PKKKK 


5664 


118 


572 


SLSMESNHKSGDGLSGTQKEAALRALVQRTGYSLVQENGQRKYG 
GPPPGWDAAPPERGCEIFIGKLPRDLFEEELIPLCEKIGKIYEM 
RMMMDFNGNNRGYAFVTFSNKVEAKNAIKQLNNYEIRNGRLLGV 
CAS VDNCRLFVGG I PKTKK 


5665 


347 


702 


WQHLIILLHCERTSPAMITSELPVLQDSTNETTAHSDAGSELE 
ETEVKGKRKRGRPGRPPSTNKKPRKSPGEKSRIEAGIRGAGRGR 
ANGHPQQNGEGEPVTLFEWKLGKSAMQRC 


5666 


213 


540 


VSCLPTSCKMITLNNQDQPVPFNSSHPDEYKIAALVFYSCIFII 
GIjF VfJ Il'ALW V F£i CTTKKRTTVTI YMMNVALVDL I FIMTLPFRM 
FYYAKDEWPFGEYFCQILGA 


5667 


I 


695 


HPLPSASLGLPSVSLHVSLCVRSAIJ.EAWPMLPXRRRARVGSP 
SGDAASSTPPSTRFPGVAIYLVEPRMGRSRRAFLTGLARSKGFR 
VLDACSSEATHWMEETSAEEAVSWQERRMAAAPPGCTPPALLD 
ISWLTESLGAGQPVPVECRHRLEVAGPSKGPLSPAWMPAYACQR 
PTPLTHHNTGLSEALEILAEAAGFEGSEGRLLTFCRAASVLKAL 
PSEVTTLSQLQ 


56S8 


691 


894 


CSFLFCIPDLFLQFLLGRKEEEAVLVGGEWSPSLDGLDPQADPQ 
VLVRTAIRCAQAQTGIDLSGCTKK 


5669 


407 


1 


DSGAPEGLSELMSTQEGLSMHAHFQAYTPFIYLHARKRRGEIGD 
ADSRFNDRYAHKSAQLYFLYFVCWIFQDVYYFTIKEKNHFFFPK 
ARGAPTKYSGSPIGSPTTTPPTRPPSFNLHPAPHLIiASMQLQKL 
KSQ 


5670 


3 




SSECLTMAWIPLLLPLLILCTVSVASYELAQPSSVSVSPGQTAK 
I TCSGDVLAKKYARWFQQKPGQAPVLVI YKDTERPSGI PERFSG 
STSGTTVTLTISGAQVEDEADYFCYSATDNFLWVF 


5671 


280 


524 


KFPPKKTPPHLGMESAITLWQFLLQLLLDQKHEHLICWTSNDGE 
FICLLKAKKVAKLWGLRKNKTNMNYDKLSRALRLLFMT 


5672 


2 


557 


FVPATPDPGVWLPPSRDPAMAKRSSLYIRIVEGKIILPAKDITGS 
SDPYCIVKVDNEPIIRTATVWKTLCPFWGEEYQVHLPPTFHAVA 
FYVMDEDALSRDDVIGKVCLTRDTIASHPKGKFSLPSHTGLPSP 
WPPSHSETSPLGSVWSPAQGKPFLLSPEAGATFCTPGLCSAACS 
QAWLLLPLP 


5673 


327 


696 


I TVADQ I SHWSAGRI KNRTRI PECIHS SAATTLAGPHTMEGES V 
KLSSQTLIQAGDDEKNQRTITVNPAHMGKAFKVMWELRSKQLLC 
DVMIVAEDVEIEAHRWLAACSPYFCAMFTGDMS 


5674 


17 


984 


GGGSMEGSSTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGE 
AKNSITDSQMDDVEWYT1DIQKYIPCYQLFSFYNSSGEVNEQA 
LKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQ 
DIiVFDLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLG 
MSEQLGYKTVSGSCMSTGFSRAVQTHSSKFFEEDGSLKEVHKIN 
EMYASLQEELKS I CKKVEESEQAVDKLVKDVNRLKREIEKRRGA 
QIQAAREKNIQKDPQENIFLCQALRTFFPNSEFLHSCVMSLKID 
MFLKVAVTTTTISM 


5675 


BO 


753 


EGSRRC-PTRLARLSARAGRLHFPPGFSSRLIHFRGVSECRRPPG 
KSGVPVSAPGSDGKWWEERPGMFSLMASCCGWFKRWREPVRKVT 
LLMVGLDNAGKTATAKGIQGEYPEDVAPTVGFSKINLRQGKFEV 
TIFDLGGGIRIRGIWKNYYAESYGVIFWDSSDEERMEETKEAM 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 


Predicted end 

location 
corresponding 
to first 
amino acid 
residue of 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, T=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=?roline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








SEMLRHPRISGKFILVLANKQDKEGALGEADVIECLSLEKLVNE 
HKCL 


5676 


2 


930 


FVSSPPPRPVQPARPGGFGLSCRRSLLCQVASTPAHVGVMRSPV 
RDLARNDGEESTERTPLLPGAPRAEAAPVCCSARYNLAILAFFG 
FFI VYALRVN ZjS VALVDMVDSNTTLEDNRTS KACPEHSAP I KVH 
HNQTGKKYQWDAETQGWI LGSFFYGYI I TQ I PGGYVAS KIGC-KM 
LLGFGILGTAVLTLFTPIAADLGVGPLIVLRALEGLGEGVTFPA 
MHAMWS S WAP PLERSKLLS I E YAGAQLGTVI SLPLSGI I CYYMN 
WTYVFYFFGTIGIFWFLLWIWLVSDTPQKHKRISHYEKEYILSS 
L 


5677 


1 


1028 


PPRDGFLELRRLSVPLCSGPCPLTSLSRQGERSGGHLVAAARAA 
VTAETHPLPLLAPLAVCQ3VKSPAACQVRPRPRAVALPAALGGP 
GRSLPGLTAATMSSFSESALEKKLSELSNSQQSVQTLSLWL1HH 
RKHAGPIVSVWHRELRKAKSNRKLTFLYLANDVIQNSKRKGPEF 
TREFE3VLVDAFSHVAREADEGCKKPLERLLNIWQERSVYGGEF 
IQQLKLSMEDSKSPPPKATEEKKSLKRTFQQ IQEEEDDDYPGS Y ■ 
SPQDP3AGPLLTEELIKALQDLENAASGDATVRQK1ASLPQEVQ 
DVSLLEKITDKEAAERLSXTVDEACLRNRGPGTS 




3 


593 


SSSPPSSTPSLPLPFYLLI.GQLRLQLLWGTAHLSGAGEAAPCPG 
GSGRTAAPRTRADPAAQS LMIMNKMKMFKRRFSLSVPRTETIEE 
SLAEFTEQFNQLHNRRNENLQLGPLGRDPPQECSTFSPTDSGEE 
PGQLSPGVQFQRRQNQRRFSMEVRASGALPRQVAGCTHKGVHRR 
AAALQPDFDVSKRLSLPMDI 


5679 


2 


623 


LKTSRVDDFVAVPGAIMDEDYYGSAAEWGDEADGGQQEDDSGEGE 
DDAEVQQECLIIKFSTRDYIMEPSIFNTLKRYFQAGGSPEWIQI, 
LS ENYTAVAQT VNLLAE WL IQTGVE PVQVQETVENHLKSLL I KH 
FDPRKADSIFTEEGETPAWLEQMIAHTTWRDLFYKLAEAHPDCL 
MLNFTVKVGRVLEDRRKVFMNVYFWLLVCFL 


5680 


258 


592 


RRLTSTSEKLQNRNSHTPLESLIHPQPSYKGFGIMFGKKKKKIE 
ISGPSNFEHRXHTGFDPQEQKFTGLPQQWHSLLADTANRPKPMV 
DPSCITPIQLAPMKTIVRGNKPC 


5S81 


45 


869 


LLCAKTLGVRTKE3QAEGYNRSGINNHQAEDPRFCPSFCWMRSA 
RQTRPQRLRKEAARPPTPGSCPGGTGMDGKKCSVWMFLPLVFTL 
FTSAGLWIVYFIAVEDDKrLPLNSAERKPGVKHAPYISIAGDDP 
PASCVFSQVMNMAAFLALWAVLRFIQLKPKVLNPWLNISGLVA 
LCLASFGMTLLGWFQLTNDEE1HNVGTSLTFGFGTLTCWIQAAL 
TLKVNIKNEGRRVGIPRVILSASITLCVGPLLHPHGPKHPHVCS 
QGPVGPGHVL 


5682 


39 


622 


PSRSCLGTMRKWRHREVNLPEVTQQDAVCPAPIPSPGLSAQTGL 
QKIWGTIHCQVCPGAPAWPGSPWHEEMGLLLLVPLLLLPGSYGL 
PFYNGFYYSNSANDQNLGMGHGKDLLNGVKLWETPEETLFTYQ 
GASVILPCRYRYEPALVSPRRVRVKWWKLSENGAPEKDVLVAIG 
LRHRSFGDYQGRVHLRQD 


5683 


89 


778 


GSCGATALITRCLAWSVLISRLAMATYTCITCRVAFRDADMQRA 
HYKTDWHRYNLRRKVASMAPVTAEGFQERVRAQRAVAEEESKGS 
ATYCTVCS KKFAS FNAYENHLKSRRHVELEKKAVQAVNRKVEMM 
NEKOTjEKGLGVDSVDKDAMNAAIQQAIKAQPSMSPKKAPPAPAK 
E ARNWAVGTGGRGTHDRD PSEKP PRLQWFECQAKKLAKHS EDD 


5684 


195 


577 


TWCFRGYLGPRVIMKALDEPPYLTVGTDVSAKYRGAFCEAKIKT 
AKRLVKVKVTFRHDSSTVEVQDDHIKGPLKVGAIVEVKNLDGAY 
QEAVINKLTDASWYTWFDDGDEKTLRRSSLCLKGERHFAESET 
LDQLPLTNPEHFGTPVIGKKTKRGRRYE 


5685 


779 


1262 


LLLQQPWHCFLLFPPFRFSHHMIPGPPGPHTTGIPHPAIVTPQ 
VKQEHPHTDSDLMHVKPQHEQRKEQEPKRPHIKKPLNAFMLYMK 
EMRANWAECTLKESAAINQILGRRWHALSREEQAKYYELARKE 
RQLHMQLYPGWSARDNYVSPSSIPVALHS 


5686 


128 




. . uMTLLDINDNHPTWKDAPYYINLVEMTPPDSDVTTWA 
VDPDLGENGTLVYSIQPPNKFYSLNSTTGKIRTTHAMLDREWPD 
n-EA3LHRKIWSVTDCGRPPLKATSSATVFVNLLDLNDNDPTF 
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ID 
NO: 
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location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
location 

amino acid 


Amino ac^d segment containing signal peptide 
(A=Alanine, C=Cysteir.e, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








QNLP F i/AEVLEGIPAGVS I YQWAI DLDEGLNGLVS YRMPVGMP 
RMDFLINSSSGVWTTTELDRERIAEYQLRWASDAGTPTKSST 
£TLTIHVLDVMDETPTFF?AVYNVSVSEDVPR\G3GWSG»AARN 
NDVGLNAELSYFITGGNVDGKFSVGYRDAWRTWGLDRETTAA 
YMLIL3AIDNGPVGKRHTGTATVFVTVLBVNDKRPIILQSSYV 


5687 


17 


917 


AAPPAPPEG/PPP/PPPAPPT/PGPAA/APASSCQPRLSAGRAA 
QGDGGAAAVGHVLWPAVGPVRVNPGLQTPVPRPELLPGP\SSS 
LHSDSSYPPDAGLSDDEEPPDASLPPDPPPLTVP/ADA/PMPVT 
SGCRMPSTSASE/AAGGQGACTHAKGSETPPPASPQTSEPAPSP 
LPPHLTGGPGMYSSEAKLPNSFSCLGLAGTGAGI*GTASAHGTG 
PPVLPHVCTPSLANPQP\AVGPEASSLPLGVSGIGMSA/SAPIS 
SSPFVAIGSCWLRGIPPPGSGFLCPGRAPGPVPITTHGQEGQGP 
VLDI 


5638 






*SHMIIRSLL*KVIDDSLGQLPLLRELLL**LNVIDRCIILAYV 
LRVEKTFAITYLKNFTVKVDFSLLGEIPLISMAAHjKLWIMKID 
DGYIPAVF 


5S89 


1504 


3 


ITSKDRGE I PAL 

IWA/RKPIGTWTATKPTHRAG*GGAEEYQPPPQPCEGPRSTSRG 
GEG*GHAVGPGREIGKEGSLPFLGPKALGF*SASCQRAFEGGAH 
GSTARKPAPATPGTRHPRTMETREVAQGWPAGPRSQFWDQHPHS 
PGEHRPSG\SPLPACPPRAWPKAGAVASATGTG\PQLPGSRGKQ 
KLPRTREPPLLQAGWAVRKPPWSEAKEGLGQAGRPSGMDSSAS\ 
PQTPGGRGSLEWGLPLYLGPHHDVK*RSDRLG*PP*GGQGGGGH 
E i tGP JGEAW + LPOQTSRPKPGPQAY*GE\GSPGLQCPCSK 
3L*RVPPGSLGPSTQCMYEPTDKHS\GGADAQLEVSTAGSRSTF 
GQELKGPLDAGRLWPGAPSAS S SHR* GG * ERARAGAGHRGST * A 
SSKIEQGRPRPGPTSDALADVEGGAES/GPHPWPLPGTLPNR/P 
GSPPPA*ASAGRKGTVSTLGGGLL 


5690 


1424 


58 


PSPPAGVCAAPAPLPLLALARRDRRPCSPGAEAAPWQTGGPAID 
GAWRTSVS.ALRRGATG/APCSPGAEAAPWQTGGPAIDG\DGELP 
*VRSEEAPRGCGAEGGGPGSGPVRRPGAGRGAHAGQGRQQDPEP 
DGLRHRQHGAASHARHRLQRLRPGHHQNRHVRRDPQAPPGGPAP 
GHAAALPERTRGVAEPPAKAHAGSDAWRAGR*SQRT*ERARPRH 
PTFQGRAGS\GQPGYQPPNPHPGPSSPPAAP\GPRGA*GNPQLE 
KAPRSDRNPSQGLRTRIRRPETPDCGPPSPAGSSASASTFRCTS 
S L3 LLGP / PGAHNLDTAPQDR* HGP *C-DKRGAPGVAGEDPRPP * 
GNFVR * LLLMP/GVA* RHGTS PFLGPSLGENGGQKDSGNLFGTP 
KG*SHPAFTKST*SMEAEKSYWNIIPIIR\DRGRQGVRINCLRVGE 
SEMWGPYSAPRPGTVFLSSFLSPASEEH\PEGSSSFHTPFPPAG 
PEGDPGLNSPGLLP 


5691 


107 


550 


-SNDPSPGYNIEQMAKRGKKLVELPYTVKGMDVSFSGILSFIED " 
VAHRMLATGECTPEDLCFSLQVMQ*KTGTESWG*RFYIVEQN*S 
GDAPLIFSPYLSLTGNCGFAMLVE ITERAMAH\ CGSPGGPSLWG 
GVGVYVLLESVPLSYS 


5692 • 


1193 


548 


TQAWTRAEKDRKGSVRALRLHLERGPPT*RGSHPL\QSVPCIQK 
PSIFSSYPI/GLPQSGGEPGPVGEQQPVRRPEQPSCGPASRMPL 
TSRSVPPGRGALPPDSLSTRKGLPRPSTAGHRVRESGHKVPVSQ 
RLNLPVMGATRSNLQPPRKVAVPGPTR*RDQDSKQDFSSKPLCS 
VPGIASTQQTLTPADSGPGTGGRDATRAGLPGVETMGNGVD 


5693 


1258 


1330 


ALTVVPVRKGTTWKAQPHGCSNLVSRARLDLSSRPSQNTEPQAP 
*QAGPPSSLRPP\SRRR*APEWPKRATGSRCRGLSAPPWPWPAA 
RGE/PGSAPSHAP/PNSPRPSGTRHP/PGPSSRVLYSPSLPRNS 
PEAIVWRSSRFPLKFPLRCCFWVSGFKDPNPVLRFF 


5694 




1338 


GS KE PARS LHR.-iGSGHKS SAG KKGSVTLS TAGALG* KQLHQ *WT 
QRCL\NNLSSEEFNASSSLNSLPSTPTASRRNSTIVLRTDSEKR 
SIAESGLSWFSESEEKAPKKLEYDSGSLKMEPGTSKWRRERPES 
CDDSSKGGELKXPISLGHPGSLKKGKTPPVAVTSPITHTAQSAL 
KVAGKPEGKATDKGKLAVKNTGLQRSSSDAGRDRLSDAKKPPSG 
IARPSTSGSFGYKKPPPATGTATVMQTGGSATLSKIQKSSGIPV 
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SEQ 
ID 


Predicted 
beginning 

amino acid 
residue of 

sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Onknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








KPVNGRKTSLDVSNSAEPGFLAPGARSNIQYRSLPRPAKSSSMS 
1 3 ^UGPRPVSSSIDPSLLSTKQGGLTPSRIKEPTKVASGRTT 
PAPVNQTDREKEKAKAKAVALDSDNISLKSIGSPESTPKNQASII 
PTATKLAELPPTPLRATAKS FVKP PSLANLDKVNSNS LDLPS S S 
DTTQCI 


5695 


3 


1330 


GSKEPARSLHRRGSGHKSSAGKWGSVTLSTAGALG*KQLHQ*WT 
QRCL\NNLSSHEFNASSSLNSLPSTPTASRRNSTIVLRTDSEKR 
SLAES3LSWFSESEEKAPKKLEYDSGSLKMEPGTSKWRRERPES 
CDDSSKGGELXKPISLGHPGSLKKGCTPPVAVTSPITHTAQSAL 
KVAGKPEGKATDKGKLAVKNTGLQRSSSDAGRDRLSDAKKPPSG 
IARPSTSGSFGYKKPPPATGTATVMQTGGSATLSKIQKSSGIPV 
KPVNGRKTSLDVSNSAEPGFLAPGAR3NIQYRSLPRPAKSSSMS 
VTGGRGGPRPVSSSIDPSLLSTKQGGLTPSRLKEPTKVASGRTT 
PAPVNQTDREKEKAKAKAVALDSDNISLKSIGSPESTPKNQASH 
PTATKLAELPPTPLRATAKSFVKPPSLAKLDKVNSNSLDLPSSS 
DTTQCI 


5696 


3 


1338 


GSKE?A*SLHR.-< iSCHI 3SAGKWGSVTLSTAGALG*KQLHQ*WT 
QRCL\NNLSSEEFNASS£LNSLPSTPTASRRNSTIVLRTDSEKR 
SIAESCLSWFSE5EEKAPKKLEYDSGSLKMEPGTSKWRRERPES 
CDDSSXGGELKKPISLGHPGSLKKGKTPPVAVTSPITHTAQSAL 
KVAGKPEGKATDK3KLAVKNTGLQRSSSDAGRDRLSDAKKPPSG 
IARPSTSGSFGYKKPPPATGTATVMQTGGSATLSKIQKSSGIPV 
KPVNGRKTSLDVSNSAEPGFLAPGARSNIQYRSLPRPAKSSSMS 
VTGGRGGPRPVSSSIDPSLLSTKQGGLTPSRLKEPTKVASGRTT 
PAPVNQTDREKEKAKAKAVALDSDNISLKSIGSPESTPKNQASH 
PTATKLAELPPTPLRATAKS FVKPPSLAIJLDKVNGNQLDLPSSS 
DTTQCI 


5697 


1147 


47 


PSEALSPPACPSAPAFRRSIISRLFGTSPATEAAPPPPEPVPAA 
2GPATVQSVEDFVPDDRLDRSFLEDTTPARDEKKVGAKAAQQDS 
DSDGEALGGNPMVAGFQDDVDLEDQPRGSPPLPAGPVPSQDITL 
SSEEEAEVAAPTKGPAPAPQQCSEPETKWS5IPASKPRRGTAPT 
RTAAPPWPGGVSVRTGPEKRSSTRPPAEMEPGKGEQASSSESDP 
EGPIAAQMLSFVMDDPDFESEGSDTQRRADDFPVRDDPSDVTDE 
DEGPAEPPPPPKLPLPAFRLKNDSDLFGLGLEEAGPKESSEEGK 
3GKTPSKENKKKKKKGKEEEEKAAKKKSKHKKSKDKEEGKEERR 
RRQQRPPRSRERTAA 


5698 


2 


S66 


GAEAAEPQEDLPPLSQSSRFFQEQQKMNKSLGPVSFKDVAVDFT 
QEEKQQLDPEQKITYRDVMLENYSNLVSVGYHIIKPDVISKLEQ 
GEEPWIVEGEFLLQSYPDEVWQTDDLIERIQEEENKPS.RQTVFI 
ETLI*R/ERGNVPGNTFDVETNPVPSRKIAYTHSLCNSCER\GF 
NASSEYISSDGRYARMKADECSGCGKSLLHIKLEKTHPGDQAYE 
FNQ 


5699 


2 


1448 


RVRQPPGLWVRRTVPAMQCPAGLSRVPGVAG/DPSLPSFRGPRD 
EAAHRGTIQTARHTRKLYVQGPASGPPLPRVSTQVAI*DEKPLA 
RPS/GRTNAPFPQGQKPAGKAAPGPAAAGRVAMR\PGHPGLLAS 
DSQRSSSKGSGWETPVPWS*AQPGWV£GLLLLGDPSGPGSL*RS 
TWLVGC-ARGPEGSGVRGSGWPSGCSDIGWALAGWNHS*HLDPNT 
WTQKWTGE/SPAPGEEG\VAPAPRGPTAEHGHCELTTESQYSNN 
VPILFQNPSGALRSRRTEPAGWVPPTRHE*DDG*TAAPASGGAP 
VSTPTKAGTP/LNASLGPTDPQGKPGCRPPCALPKPAGPERSA* 
GGSLGCR/SMLPASSGPPPAPGPRRLAAGAHTSASARCPPAAAA 
GWQPRRPGFAGRAALPGPPHPPSS*RELGGLPGPGW*TLDPLPA 
HPAHPPGSAPPWGALGGWAAARASLPWSPSLCLSFPAVTPVAGL 
FPPGRG 


5700 


923 


597 


3NIHKNS KS2SHLNQDHS FPPPTPNSARS 
KLESTGTAKNTGLPLSGAPRQRAVFSGRTICQEFSSCLQCAYLD 
E*CSIASSLIKAILRVSVLSE 


5701 


59 


410 


1" ^ I i 1 PEINPQICSWLIFDKGAK/NHATGKDSLFN 
KWSWKNWLSTCR*MRPGPYFTPYTKINSK*IK/DANIRCETVKL 
L3ENT3ENLHDTGLGHVFLDMTPKTQPTKQK 
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Predicted 
Deginning 



corresponding 
first 

residue of 

sequence 



redicted end 
icleotide 

corresponding 
i first 
lino acid 
■.sidue of 



Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Asparric Acid, B= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=3top 
Codon, /=pos£ible nucleotide deletion, 
\=possible nucleotide insertion) 



ETFVDPSQCGGIPSDSPHPVITPSRASESSASSDGPHPVITPSR 
ASESSASSDGPHPVITPSRASESSASSDGLHPVITPSRASESSA 
SSDGPHPVITPSRASESSASSDGPHPVTTPSRASESSASSDGLH 
PVITPSRASESSASSDGPHPVITPSWSPGSDVTLLAEALVTVTN 
IEVIMCS I TE I ETTTSS I PGASDTDLIPTEGVKASSTSDPPALP 
DSTEAKPHITEVTASAETLSTAGTTESAAPHATVGTPLPTNSAT 
EREVTAPGATTLSGALVTVSKNPLEETSALSVETPSYVKVSGAA 
PVSIEAGSAVGKTTSFAGSSASSYSPSEAALKNFTPSETLTMDI 
TTKGPFPTSRDPLPSVPPTTTMSSRGTNSTLAKITTSAKTTMKP 
PTATPTTARTRPTT\A*VQVKMEVSSSCG*VWLPRKTSLTPEWQ 
KG* CS SS TGNS TPTRLTS RS P YCVSGEANG / PSAAARHVP YAKR 
GCCP*PGPPPTDCSCVTVLRGTQKVPMKGSMSKPLTPDVATGPS 
LTSTGVYVWGGASPVPRGVLC-LTLAHVLCFSKEKT 



HHKDSRSQGLPRTQECARPELRPLLCPRALVJPVTRLSYRCPWQA 
P1CAGIGTKAKPSESHLKLHPGWPSLDRQGEPATLGTGTGHCSDS 
RILRWHP*HTAAR*PRWRRLPSSHRWTRHLGVLRVQDKS**VSL 
DPSCRPRFLRTC**YGMRSVASSSNPPPGWSGPGASVFPARPVS 
ALPTGPRCW*APRGRTRQPCGWPRLS3PHATADWGPGCPLSPSR 
GSWETAPGS*WCPWL*AARWTGWRTASGASAGLGRAADRPSAWA 
RRVAGLLPGQGLTVRR * H * TAGAPAS VR£ S QGATRS FAPGGDQ C 
ACGRGPGSC*HPPPWPVS?SSPVPCPSGR*HLRGPLLSAARPRA 
i PRH "IIDTQTPEP 



GDYEFDS5YWDDISQAAKDLVTRLMEVK0DQRITAEEAISHEWI 
SGNAASDKN I KDGVCAQ I E KNFARAKWKKAVRVTTLMKRLRAPE 
QSSTAAAQSASATDTATPGAAGGATAAAASGATSAPEGDAARAA 
KSDNVAPRRP*LPPQPQMEVPPQPLMAVSPQPPM2ASLQPLMGE 
SPQP 



GDYEFDSPYWDDISQAAKDLVTRLMEVEQDQRITAEEAISHEWI 
SGNAASDKNIKDGVCAQIEKNFARAKWKKAVRVTTLMKRLRAPE 
QS S TAAAQSASATDTATPGAAGGATAAAASGATSAPEGDAARAA 
KSDNVAPRRP+LPPQPQMEVPPQPLMAVSPQPPMEASLQPLMGE 

SPQP 

QLGRFXAQDTVAIRKVKEVFGTGAMRHWILFTHKED*GGQALD 
DYVANTDNCSLKDLVRECERRYCAFNNWGSVEEQRQQQAELLAV 
IERLGREREGSFHSNDLFLDAQLLQRTGAGACQEDYRQYQAKVE 
WQVEKHKQELRENESNWAYKALLRVKHLMLLHYEIE 
LFFIIFLF 

GSPAPTPGPRRRPGRGTPSPGTRHHQGRAEPKPDAPERAPLRR* 
MFAIQPGLAEGGQFLGDPPPGLCQPELQPDSKSNFMASAKDANE 
NWHGMPGRVEPILRRSSSESPSDKQAFQAPGSPEEGVRSPPEGA 
EIFGAEPEKMGGAGTVCSPLEDNGYASSSLSIDSRSSSPEPACG 

TPRGPGPPDPLLPSVAQA 

SFSWEETISPCFPKMPAEPWWkSPVSLGAAGKPGQPRPYLDLPA 
QASVSRPHDRA*G3AVSLSLSSGDVCGHTDGGGAGSDPQAKPKP 
PRCPFTAMPSPRTKQKVRWKVCLLIAIRYSDIPSDVSKAP\GPA 
GNPHDRSSTAA*LHRRAGAGSLCLSASLLPPSFSLGAPGAPSPL 
RVSPASGGPRKEGRQGSGG*AGGGGP\ARTHADLPCVGFVCSPP 
LLK*SDSPVKQLPA\SGQGSGAGMPPVGSSDILRPRPTSVSGTG 
RAAG*CSWQPAACCTPRSQ*WAVARSPSRCSRW*RQSGR*RG*S 
SRRRRGP*AAGRSTPAVP*PCS*GGAGRRAYACRTGWGYAPSR* 
LEPSGPTSG5AIi*TWASHSTGA**SRLCGTAGTGPLCSQSSRS* 
AG*RCCCTAASPCGGSGPSHPGSPSAHCLSWSGGRTQPRAPSAH 
GRGRAMGSRCVCTCTGLPCPGIPLSGASPGGSGETGAGRSHTLK 
AARSRLS PRPGSGSRGSY* SHNDNWGTWPAPPSAGHLLVGG*IIS 
QRTSSDH* YTGTRRPWAGPGTRCSTAPSRAAPPVSRCRPPPPPP 
PPRPPRLPAAAS /SGGASGS PAASCS CS CRAPAKPAS S / GEAPA 
PPPRPEPPPPPARRP 

ITLCPLPQTEKCLIWVTEAATPLGIYLKARVEAGGLKELEISWG 
LHQIWRWGAWMRAGMGGCRCWGVMAPFAPR/NALSFLVNDCS 
LIHWJTVCMAAVFVDRAGEWKLGGLDYMYSAQGNGGGPPRKGIPE 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 

residue of 
amino acid 


Predicted end 

location 
corresponding 
to first 


hra n > acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenyl al anir.e , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L= Leucine, M=Methionine, NaAsparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LI X J-PELADSS3RWREKRSADMWRLGCLIWEVFNGPLPRAA 
ALRNPGKIPKTLVPHYCELVGANPKVRPNPARFLQNCRAPGGFM 
SNRFVF.TNLFLEEIQIKEPAEKQKFFQELSKSLDAFPEDFCRHK 
VLPQLLTAFEFGNAGAWLTPLFKVGKFLSAEEYQQKIIPVWK 
MFSSTDRAMRIRLLQQMEQFIQYLDEPTVNTQIFPHWHGFLDT 
NPAIREQTVKSMLLLAPKLNEANLNVELMKHFARLQAKDEQGPI 
RCNTTVCLGKIGSYLSASTRHRVLTSAFSRATRDPFAPSRVAGV 
LGFAATHNLYSMNDCAQKILPVLCGLTVDPEKSVRDQAFKAIRS. 
FLSKLESVSEDPTQLEEVEKDVHAASSPGMGGAAASWAGWAVTG 
VSSLTSKLIRSHPTTAPTETNIPQRPTPEGVPAPAPTPVPATPT 
TSGHWETQEEDKDTAEDSSTADRWDDEDWGSLEQEAESVLAQQD 
DWSTGGQVSRASQVS\TPTTNPPNPQSPTGAAGK\RGLLGTGLA 
GAKLPGATS * RYTAGQRV 


5710 


1 




IPGSTISCEVELR^RMAKTIDSFTONQTRLWIIDGLDACEQDK 
VLQMLDTVRVLFSKGPFIAIFASDPHIIIKAINQNLNSVPSGFK 
\LKGHDYMRNIVHLPVFLIJSR3L/RQ/LQENFS*LQQQMETFHA 
QILQGYRKMLTEEFHRTALGR*QKLVARQPSIDG*DAIGFELYV 
CIAIQFNTNKDDAT 


5711 


1526 


1130 


K?F I IDVAFTt ..FYPDGAQPFIVKGESS 
SQIAKAVLSQQRPSLFHECAFHFFS * SLQRHT IMLDQ3I F * LLM 
LSEERQHLFES S / 1 WTTPHNLK* / FE IHEHLGSIIEGHWTLFFLL 

QIL 


5712 


3 


1391 


GRKLFQSIjniSERLKFI.T/rt.UHVnnTT.IVrjAEEHGCLDIIKELP 
ETVIDLLNKCLTFHPSKRPTPDELMKDKVFSEVSPLYIPFTKPA 
SLFSSSLRCADLTLPEniSQLCKDTNNDYLAERSIEEVYYLWCL 
AGGDLEKELVNKEIIRSKPPICTLPNFLFEDGEGFGQGRDRSS/ 
TFR*YHWDIWMPAKK*IERCWGRSILPITLKMTSLILPYSNSN 
NELSAAATLPLIIREKDTEYQLNRIILFDRLLKAYPYKKNQIWK 
EARVDIPPLMRGLTWAALLGVEGAIHAKYDAIDKDTPIPTDRQI 
EVDIPRCHQYDELLSSPEGHAKFRRVLKAWWSHPDLVYWQGLD 
SLCAPFLYLNFNNEALVYACMSAFIPKYLYNFFLfZDNSHVIQEY 
LTVFEQMIAFHDPELSNHLNEIGFIPDLYAIPWFLTMFTHVFPL 
HKIFHLW \ DTLLLGEFLFP I LYWE 


5713 


634 


234 


PVCAVPVDRWPVLPREDQEGQQI^AKLPRDFRR+FQILGPMEGH 
TACRCSRRGAQVQHLPREDIRAAE*DPHLREVWPGLPTSSATSP 
* RAVLTS P CSHLGS ADAAS SHWLCGVS FH 


5714 


212 


613 


WGLGLGPTMSSLGGGSQDAGGSSSSSTWGSGGSGSSGPKAGAAD 
KSAWAAAAPASVADDl'PPPERRNKSGIISEPLNKSLRRSRPLS 
HYSSFGSSGGSGGGSMMGGESADKATAAAAAASLLAKGHDLAAA 
MA 


571S 


131 


1379 


ESASQQKRSKCLILTLKLELSGSAPKKTGARPGGSLWLPPHSQE 
QTPPASKLQG3GGGLQTGWGLHPVPVTAASPLPRWCLFGAVAK\ 
GLP3P*LCPSGAA/GGLQRGPGLSPLGAAGKVSCLHPPSMVENN 
DSTCHEHHEGILAARVTPVP\SGKPGRVLKPPGRVCRPPHPAAS 
PRPPGE / SDLDGPRPQMHLRAFPAAHGGPVNTPHGGEEKTFMSS 
QIRRKETKPL*RKTPAG\NKYQSNSIPVSQSPQLTVDLLPSAGR 
TQAPSGRGDAGKPTPGHG\LPKASVILTPNCPCSLAGGQ*PPGL 
YPXTPKQRRWRRPL/LLGPSQ*GSRQGTC*EV\GALGEPVRIPG 
L*PDLSCILSNGSKHRREGLSFPRSLGPGRRGPAGLQSLGCSPT 
PKNTACHSSGHVALQAGHDSARDVGSGHVALQAGHDSTQDVGRP 
WRWIPLE*LGLSRETGQATRRGLVWISPGRAAAACVACAQALE 
EGPLRLPGQDRGAQPCSHCPGRAAGQPEPGAGAPCRF,/GG*DPT 
GLT/GVPGTDPKRGGRKPGQSGQETQGPTVWSGPESPLQPKP*E 
RQE/VGAGASS3VGLSRGRAGGPSSAWEVAAMLLLLRHGSHSEL 
TDLTEAQTSQH 


5716 


1711 


1370 


F.VFSLLCEGPGHCYQGAVCREACAAASPGLDSAAEPHRLCEHTD 
*LPK*GPGYIQHFHCDSNILCILYNISFNliFSYSF*GVARYAC* 
RCPLVL*SGFFTIIVGGYSCCMPLKT 


5717 




14BS 


LPTEALRESEWVSEYGKCGPRGLVPEGESTSPLPSSVDTEDSLD 
EGPGALVLEGDLLLGQDLEFEEEEEEEEGDGNSDQLMGFERDSE 
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ID 
NO: 


Predicted 

nucleotide 

corresponding 
to first 

residu^of 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 

amino^acid 


Amine acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, B= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyxosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








GDSLGARPGLPYGLSDDESGGGRALSAESEVEEPARGPGEARGE 
RPGPACQLCGGPTGEGPCCGAGGPGGGPLLPPRLLYSCRLCTFV 
SHYSSHLKRHMQTHSCEKPFRCGRCPYASAQLVNLTRHTRTHTG 
EXPYRCPHCPFACSSLGNLRRHQRTHAGPPTPPCPTCGFRCCTP 
RPARPPSPTEQEGAVPRRPEDALLLPDLSLHVPPGGASFLPDCG 
Q\CGVKGRASAGLDQNHCQS/SLFPWTCRGCGQELEEGEGSRLG 
AAMCGRCMRGEAGGGASGGP0GPSDK3FACSLCPFATHYPNHLA 
RHMKTHSGEKPFRCARCPYAEAHLDNLKRHQRVHTGEKPYKCPL 
CPYAG3NLANLKRHGRIHSGEKPFRCSLCNYSCNQSMNLIRHM 


5718 


120 


284 


VAHALSLPAESYGNDVSMTHPQLPPTQLAWDLCRTCLPLSYNFT 
S**STADPLHL 


5719 


48 


428 


ELNNGPFQMPLCNGGNLAVTGSWADRSPLHEAASQGRLLALRTL 
LSQGYNVKAVTLDHVTPLHEACLGDHVACARTLLEAGANVNAIT 
IDGVTPLFNACSQGSPSCAELLLEYGAKAQP\ESGLPSP 


5720 




1051 


LQAFRMASEVPMVLVGTQDAISAA\NPRVYRRTSRARKLSTDLK 
\RCT\YYE\TCGGTYGLQHWSVSFQDVAQKWAL\RKKQQ\LAI 
GPCK\3LPN\SPSH\SAVSAASIPARAPINQGHE/SGGGSAFSD 
Y\SSSVPSTPSISQRELRIETIAASSTPTP1RKQ3KRRSNIFTS 
RKGADPXDREKKAAGCKVDSIGSGRAIPIKQGIJjLKRSGKSLNK 

ewkkkyvtlcdwglltyhpslhdymqmihgkeidllrttvkvpg 
krlpratpatapgtsprawglsversntqlgggtgaphsassas 
lhser?lsssawagprpegt,hqrscsvssadqwseattslppgm 

QKPASG 


5721 




492 


RKS PC RLRRTERSSNAAVST/TTVQQFKRFIENYRKHIGCVA 
'1 i U 'JGLFLERAYYYAFAAHHTGITDTTRVGIILiSRGTAASI 
SFMFSYrLLTMCRMLITFLRETFLNRYVPFDAAVDt'Hl^lA.STA 


5722 


88 


1043 


VALDVLAGSSPGGGMAGALLGPRVHGIRAVLRVARGGVQAPGAP 
GSLGVSHAAAPPARPCGAAQSPHRGRRHGGGGAGLPPPRSPRFP 
QESVPASTSTARGPRRVSRRLPPQHPGPRGRRRRPGAGVGAPRR 
GRARGQAGLLGRQGQGGRGAERERAALQARRGRRPGPEPDQSCG 
GRPRRAAAAPGRAPADPQPPAPRPAPAPDVRPPADAPAPAPAPA 
PPPPPHLGALTAGSGEERQSQPRAETLRL3RGAPLP\PRAERGG 
RPKQAEQQQ\PKRPTPPARGPQSSGDPAMLPQRAGLRTGGLAGT 
KSSTREIPEMI 


5723 


88 


1043 


VALDVLAGSSPGGGMAGALLGPRVHGIRAVLRVARGGVQAPGAP 
GS1GVSHAAAPPARPQGAAQSPHRGRRHGGGGAGLPPPRSPRFP 
QESVPASTSTARGPRRVSRRLPPQHPGPRGRRRRPGAGVGAPRR 
GRARGQAGLLGRQGQGGRGAERERAALQARRGRRPGPEPEQSCG 
GRPRRAAAAPGRAPADPQPPAPRPAPAPDVRPPADAPAPAPAPA 
??PPPHLGALTAGSGEERQSQ?RAETLRLGRGAPLP\PRAERGG 
RPKQAEQQQ\PKRPTPPARGPQSSGDPAMLPQRAGLRTGGLAGT 
KSSTREIPEMI 


5724 


3 


1841 


JTNEAPPAPLPDASASPLSPHRRAKSLDRRSTEPSVTPDLLNFK 
KGWLTEQYEDGQWKKHWFALADQSLRYYRDSVAEEAADLDGEID 
LSACYDVTEYPVQRNYGFQIHTKEGEFTLSAMTSGIRRNWIQTI 
P4KHVHPTTAPDVTSSLPEEKNKSSCSFETCPRPTEKQEAELGEP 
DPEQKRSRARE\RRREGRSKTFDWAEFRPIQQALAQERVGGVGP 
ADTH\DPWRPEAEHGELERERARRREERRKRFGMLDATDGPGTE 
DAALRMEVDRSPGLPMSDLKTHNVHVEIEQRWHQVETTPLREEK 
QVPIAPVHLSSEDGGDRLSTHELTSLLEKELEQSQKEASDLLEQ 
nELLCDQLRVALGREQSAREGYVLQATCERGFAAMEETHQKKIE 
J LQ1 . -QRELEKLREEKDRLLAEETAATISAIEAKKNAHREEME 
RELEKSQRSQISSVNSDVEALRRQYLEELQSVQRELEVLSEQYS 
'.'1 1 ' L, JKAHLAQALEAERQALRQCQRENQELlJAHNCELNNRIiAAE 
ITRLRTLLTGD3GGEATGSPLAQGKDAYELEVPSGARPCLTQLC 
TQEPQGSAAWPLSYSWGGTDLRQQESQGPGRSKSPEGGEEQ 


5725 


3 


1049 


\m --eetsq3pnrtephdsdcsvdlgiskst3dlspqksgpvg 
swkshsitnmeigglkiydilsdnVdlsshlqplk/ftsaveg 
knivrskaatllydqplqvftgsssssdlisgtkaifkfdsnhn 
pe/gakynkrphkwahnlhlkymvlhsiisntvav\rsqrhfva 
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SEQ 
ID 
NO: 


Predicted 
nucleotide 
corresponding 

residue°of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
tA=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LQTKSPNRPCQFSSSAPS/VDQRAQ/ INQSYAKHSANMNFSNHN 
NVRANTAYHLHQRLGPARHGEKWAI S PNDRLI PAVTRS TIQRQS 
SVSSTASVNLGDPGSTRRAQIFEGDYLSYREFHSAGRTPPMMPG 
SQRPLSART YS I DGPNASRPQSARPS INE I PERTMSVSDFNYSR 
TSP 


5726 


2 




SRSLSMWWNSGLPASSHSSKLPVTVGFSGCVKRLRLHGRPLGAP 
TRMAGVTPCILGPLEAGLFFPGSGGVITL/ESVGAGIPGPSRAG 
QGSPGGSGEGPPLSSPSQPLPADLPGATLPDVGLELEVRPLAVT 
GLIFHLGQARTPPYLQLQVTEKQVIjLRADDG 


5727 


21 


221 


RPILILKETRRLPWATGYA3VINAGKSTHNEDQASCEVLTVKKK 
AGAVTSTPNRNSSKRRSSLPNGE 


5728 


2 


877 


GTRNGQFEPRRGRAWEGSAGGLRAPGAAAGGPGVCjPRGSG/LPG 
NAIRAGVNPGRGPASPFWDLSLPWDLWPPPTDHAPGAPDFPAVE 
GR\PWAGGRPPWPVSGVLGSRVCGPLYSTSPAGPG/SGGLSPSQ 
GGPAGAGGDAG/liPGRCPSAPWRAGSRPAASCPDWIPGPQGLWL 
HRNPTS/GPPSQIGEGAEQGDEGVADAPQIQCKN/GAEDPPAED 
EPPQVPEAGEEDAVPAEEGPGGTPETQADQVRERPEAHLAEGGA 
KGSPRRLADPQDLPAGOMSLAPPFPPVAAV1ESNK 


5729 


1 




AGGARKVLTLQl.GHFAGFVGAHWWNQQDAALGRATDSKEPPGKL 
CPDVLYRTGRTLHGQETYTPRLILMDLKGSLSSLKEEGGLYRDK 
QLDAAIAWQGKLTTHKEELYPKNPYLQDFLSAEGVLSSDGVWRV 
KSIPNGKGSSPLPTATTPKPLIPTEASIRVWSDFLRVHLHPRSI 
CMIQKYNHDGEAGRLEAFGQGESVLKEPKYQEELEDRLHFYVEE 
CDYLQGFQILCDLHDGFSGVGAKAAELLQDEYSGRGIITWGLLP 
GPYHRGEAQRHZYRLLNTAFGLVHLTAHSSLVCPLSLGGSLGLR 
PEPPVSFPYLHYDATLPFHCSAILATALDTVTCS\YRLCSSPVS 
MVHL\ADMLSFCGKKVVTAGAIIPFPLAPGQSLPDSLMQFGGAT 
PWTPLSACGEPSGTRCFAQSWLRGIDRACHTSQLTPGTPPPSA 
LHACTTGEEILAQYLQQQQPGVMSSSHLLLTPCRVAPPYPHLFS 
SCSPPGMVLDGSPKGAAVESVPVFG 


5730 


1258 


1713 


KKFQAP ARETC VE CQ KTVYPME RLLANQQVFH ISC FRCS YCNNK 
LSLGTYASLHGRIYCKPHFHQLFKSKGNYDEGFGHRPHKDLWAT 
KIETEGFWERPRNFSNQGRPLKSPGGEDCPSC*GGCPGSNY*AQ 
GSSSREKGGQASWNPK1RVA 


5731 


122 


443 


RSHRGELIPKDSCYMRKPPRRPKKRRQG/CALPQGCLTFKDVAI 
EFSLEEWKCLNPAQRALYRAVMLEKYRNLESVGLTSKDSWYMRK 
KPGRGRGKQRRQEWFFLRVY 


5732 


22S 


772 


PPSRSCQSPRRKSRRRAHVTVTLVCGFTSFSFSLPLYLCGCLRF 
PERTCSQLQQADWA?DFGPSSFVPSWGATATGARKFLIAFNI\N 
LLGTKEQAHRIALKLREQGRGKDQPGRLKKVQGIGWYLDEKNLA 
QVSTNLLDFEVTALHTVYEETCREAQELSLPWGSQLVGLVPLK 
ALLDAA 


5733 


1 


4G0 


PALQ3VKIAKTALAWGKQYENDARTLFEFTSGVNDTESPIIYRDES 
MRTACSPDGLCSDGNGLELKCPFTSRDFMKFRLGGFEAIKSAYM 
AQVQYS MWVTRKNAW YFAN YD PRMKREGLH YWI ERDE KYM \AS 
FDEI\VP\EFIGKMDEVLSRDPM 


5734 


3 


968 


K 1 FSL-SLLVLLTTANNLFVLIPAYSKNRAYAIFFIVFTVI 
GSLFLMWLLTAHYSQFRGYLMKSLQTSLFRRRLGTRAAFEVLS 
SMVGEGGAFPQAVGVKPQNLLQVLQKVQLDSSHKQAMMEKVRSY 
GSVLLSAEEFQKLFNELDRSWKEHPPRPEYQSPFLQSAQFLFG 
HYYFDYLGNLIALANLVSICVFLVLDADVLPAERDDFILGILNC 
VFIVYYLLEMLLKVFALGLRGYLSYPSNVFDGLiLTWLLVLEIS 
TLWCTDCHTQAGGRRWW/SLLSLWDMTRMLWMLIVFRFLRIIP 
SMKPMAWASTVLGL 


5735 


2 


540 


FFTPCVARAFNFPDQATVKKAAYSLPRVGGGTSCGLPQARRISL 
ATPRQLYK/ S SNMTQRWQRRE ISNFE YLMFLNTI AGRTYNDLNQ 
YPVFPWVLTOYESEELDLTLPGNFRDLSKPIGALNPKRAVFYAE 
RYETWEDDQ3PPYHYNTHYSTATSTLSWLVRIVSIFIELACLWY 
LKILT 


5736 




382 


GTRPSTKKSGYSPQQVAVIHCKGHQKENTAVAHSNQKADSAAQV 
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ID 
NO: 


Predicted 

nucleotide 

location 

corresponding 

lis 


i - lie ted end 
nucleotide 

corresponding 
to first 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Deucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan / Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








TARLSVTPPNLLPTVSFPQPDLPDNPVYSTTTEKLASDLRANKN 
QE3**ILPDSGIFIP*T*TSYLQSTTHLRRAKLPQLLRR 


573 7 


290 


1041 


KACLHLLSSFLTSNFLFNPLLPDSLYSVEAR3QRAWLGPCRRKR 
LQTLMRLAAGFQYSSHKDPSLSAKEKHTDYHNEARGPWPGWVG* 
RTADGSCGRGPDGAHHPGPKSSSWRASRLLPGLGGSHHLDAYVG 
RD1ECGTPAPLQLEIPPQPRGHPAPIPTGQAGPRDSGPGASP*V 
3TRPLTDGRR * PGVRPVGVJTPAHPAGTLRPRGAVEPSVSACGKW 
APSPTSQGCCEGRCDAVPKHRAWRTPLCSQ 


5738 




4S0 


DT_SLNCTLPETLPMTPSF*LSFL*FPGLARAKSIPTKTYSNEV 
VTLWYRPPDI LLGSTDYS TQ I DMW * GQVEVWQGPCGKGGGLVTT 
ATQPAAFLFTVPSLPRGVGCIFYEMATGF.PLFPGSTVEEQLHFI 
FRILSEEAWALCAVETHR 




1 


1222 


SFQRRGIRW1SP/HTLHPHPRAVWAGIGRGHGS*ALLGRARAPALC 
FPTLLEFLESLEPDLPALRAMGLHLWAAGPGTHPAGISDLLAEV 
SAEVD3PVPGYLSSPQSITDTCLYIFTSGTTGLPKAARISHLKI 
LQCQGFYQLCGVHQEDVIYLALPLYHMSGELLGIVGCMGIGATV 
VLKSKFSAGQFWEDCQQHRVTVFQYIGELCRYLVNQPPSKAERG 
HKVRLAVGSGLRPDTWERFVRRFGPLQVLETYGLTEGNVATINY 
TGQRGAVGRASWLYKHIFPFSLIRYDVTTGEPIRDPQGHCMATS 
PGEPGLLVAPVSQQS PFLGYAGGPELAQGKLLKDVFR PGDVFFN 
TRDLLVCDDQGFLRFHDRTGDPFRWKGENVATTEVAEVFEALDF 
LQEVNVYGVTV 


5740 


2S5 


231 


-TLCLESKTDLREKASHVSAQLQGEVRGLAGAI,WM*A 
I VY ERVYN *n; S RMVHAT .EQRRHPAGLS S SMALQLNP CLGMLMA 

_L 1 L ) 1 1 


5741 


1 


65 0 


1 LI 1 L 1TLPI l GA1PASGD 

yvarpgdkvaarvkavdgdeqw1laewsyshatnkyevddide 
egb:erhtlsrrrviplpqwkanpetdpealfqkeqlvlalypqt 
tcfyralihappqrpgddysvlfedtsyadgyspplnvaqryw 

ACKEPKKK*CRLADSPSPNDTGQDSRGRAGIKHIPPLKKK 






362 


TQSVKEILKRMPNVNLTDKDGNTALMIASKEGHTEIVQDLLDAG 
TYVNIPDRSGDTVLIGAVRGGHVEIVRALLQKYADIDIRGQDKfK 
TALYWAVEKGNATMVRDI LQCNPDTE I CTKDG 


5743 


2 


415 


3KTPEGIDAIEEIEIDLEETEREISPQENGLEEVKPLGEMQTDL 
KATGRE I SPREKTPEVIDATEE IDKDLEETGRRE I S PEENGPEE 
VKPVDEMETDLKTTGREGSSREKTREVIDAAEVIETDLEETERE 
ISPQE 






703 


TRRTTTTSPTTTRQMTTTPAALPTTWTTPDLTTGTPLQMTTIA 
VFTTANTCLSLTPSTLPEEATGLLTPEPSKEGPILTAESETVLP 
SDSWSSAESTSADTVLLTSKESKVWDLPSTSHVSMWKTSDSVSS 
PQPGASDTAVPEQNKTTKTGQI4DGIPMSMECNEMPISQLLMIIAP 
SLGFVLFALFVAFLLRGKLMETYCSQKHTRLDYIGDStCNVLNDV 
QHGREDEDGLFTL 


5745 


1400 


599 


GKSRFWLMKHSKKTYDSFQDELEDYIKVQKARGLEPKTCFRKM 
KGDYLE^CGYKGEVNSRPTYRMFDQRLPSHTIQTYPRSCNIPQT 
VENRLPQWLPAHDSRLRLDSLSYCQFTRDCFSEKPVPLNFNQQE 
YICGSHGVEHRVYKHFSSDNSTSTHQASHKQIHQKRKRHPEEGR 
EKSEEERSKHKRKKSCEEIDLDKHKSIQRKKTEVEIETVHVSTE 
KLKNRKEKKSRDWSKKEERKRTKKKKEQGQERTEEEMLWDQSI 
LGF 


5746 


3 


821 


" - F LT ? 3 SPAFDGELDLQRYSNGPAVSAWSLGMGAVSWSES 
RAGERRFPCPVCGKRFRFNSILALHLRTHQPERPRSPAARLLLE 
LEERALLREARLGRARSSGGMQATPATEGLARPQAPSSSAFRCP 
YCKGKFRTSAERERHLHILHRPWKCGLCSFGSSQEEELLffflSLT 
AHGAPERPLAATSAAPPPQPQPQPPPQPEPRSVPQPEPEPQPER 
EATPTPAPAAPEEPPAPPEFRCQVCGQSFTQS'tfFLKGHMRKHKA 
SFDHACPV 


574 7 


2 


1328 


DRHVETLCIHFLGPSTGSTAKTGGRNWLKTGNCLYGNTCRFVHG 
PS PRGKGYSSNYRRS PERPTGDLRERIKNKRQDVDTEPQKRNTE 
ESSSPVRKESSRGRHREKEDIKITKERTPESEEENVEWETNRDD 



366 



WO 01/53312 



PCT/US00/34263 



SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 


Predicted end 

co r re sponding 
to first 

residue of 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=I,ysine, 
L=Leucine, M=Methionlne, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion! 








SD1TGDINYDYVHELSLEMKRQKIQRELMKLEQENMEKREEIIIK 
KEVSPEWRSKLSPSPSLRKSSKSPKRKSSPKSSSASRKDRKTS 
AVSSPLLDQQRNSKTNQSKKKGPRTPSPPPPIPEDIALGKKYKE 
KYKVKDRIEEKTRDGKDRGRDFERQR3KRDKPRSTSPAGQHHSP 
ISSRHHSSSSQSGSSIQRHSPSPRRKRTFSPSYQRTLTPPLRRS 
ASPYPSHSLSSPQRKQSPPRHRSPMRSKGRHDHERTSQSHDRRH 
ERREDTRGKRDREKDSREEREYEODQSSSRDHRDDREPRDGRDR 
RE 


5748 


934 


473 


SEGPQVFYKGLAPTLIAIFPYAGLQPSCYSSLKHLYKWAIPAEG 
KKNESTLQNLLCGSGAGVISKTLTYPLDLFKKRLQVGGFEHARAA 
FGQVRRYKGLMDCAKQVLQKEGALGFFKGLSPSLLKAALSTGFM 
FFSYEFFCNVFHCMNRTASQR 


5749 


552 


1 


GFPVDPRVRGSTL3LAERPKGMIRSGSFRDPTDDVHGSVLSLAS 
SASSTYSSAEERMQSEQIRKLRRELESSQEKVATLTSQLSANAN 
LVAAFEQSLVNMTSRLRHIAETAEEKDTELLDLRETIDFLKKKN 
SEAQAVIQGALNA3ETTPKELRIKRQKSSDSISSLNSITSHSSI 
GSSKDADA 


5/50 


22 


866 


IFISICLWl LPKDCIE I 1 GRYLAIQF 
IILEWAYVFLYYYEYRKAKDQLDIAKDISQLQIDLTGALGKRTRF 
QENYVAQLILDVRREGDVLSNCEFTPAPTPQEHLTKNLELNDDT 
ILNDIKLADCEQFQMPDLCAEEIAIILGICTNFQKNNPVHTLTE 
VELLAFTSCLLSQPKFWAIQTSALILRTKLEKGSTRRVERAMRQ 
TQALADQFEDKTTSVLERLKIFYCCQVPPHWAIQRQLASLLFEL 
GCTSSALQIFEKLEMWE 


5751 


3 


751 


scgsalrawrcgaaalatfpapalpglmyralyafrsaepnala 
faagetflvlerssahwwlaararsgetgyvepaylrrlqgleq 
dvlqa;draieavhntamrdggkysleqrgvlqklihhrketls 
rrgpsassvavmtsstsdhhldaaaarqpngvcragferqhslp 
ssehlgadgglfqiplpssqippqprraapttppppvkrrdrea 
lmasgsgghntmpsggnsvssgssvssci 


5752 


3 


471 


GPVCGVGLSVAWAGPWRGPVHSVGGGGRAALHGAELPCLSGAAT 
VEREMELRHKNEMLRVETEARARAKAERENADIIREQIRLKASE 
HRQTVLESIRTAGTLF3EGFRAFVTDRDKVTATVNIFIKQGWQV 
AERQHVGA3 WS PRSCP CRLCTAL 


5753 


34 


483 


DDSXAI PGG VCAP FGAVRN I YTPRTGHR I R KLDQ I Q S GGN Y VAG 
GQEAFKKLNYLDIGEIKKRPMEWNTEVKPVIHSRINVSARFRK 
PLQEPCTIFLIANGDLINPASRLLIPRKTLNQWDHVLQMVTEKI 
TLRSGAVHRLYTLEGRLV 


5754 


14 


331 


TLVHWEFAGEHAEAIASREQEVLQGWKELLSACEDARLHVSST 
ADALRFHSQVRDLLSWMDGIASQIGAADKPRCPSSLLGLPASPW 
WPTPATPSPLTAPFSME 


5755 


3 


888 


LGDQFYKEAIEHCRSYNSRLCAERSVRLPFLDSQTGVAQNNCYI 
S\fMEKRHRGPGLAPGQLYTYPARCWRKKRRLHPPEDPKLRLLEIK 
PEVELPLKKDGFTSESTTLEALLRGEGVEKKVDAREEESIQEIQ 
RVLENDENVEEGNEEEDLEEDIPKRKNRTRGRARGSAGGRRRHD 
AASQEDHDKPYVCDICGKRYKNRPGLSYHYAHTHLASEEGDEAQ 
-i ETRS E PI JHRNENHRPQKGPDGTVIPNNYCDFCLGGSNMNKKS 
GRPEELVSCADCGRSAHLGGEGRKEKEAAA 


575S 


3 


621 


SSKLQA^FAHPLYNVPEEPPLLGAEDSLLASQEALRYYRRKVAR 
Wt-IRRHKKYRSQMNLTSLDPPLQLRLEASWVQFHLGIWRHGLYSR 
SSPWSKLLQDMRHFPTISADYSQDEKALLGACDCTQIVKPSGV 
HLKLVLRFSDFGKAMFKPMRQQRD3ETPVDFFYFIDFQRHNAEI 
AAFH1ERILDFRRVPPTVGRIVNVTKEIL 


5757 


3 


473 


YKD.^lLLLPDNHRQWFENGTLKIjTDVQKGMDEGEYLCSVLIQPQ 
LSISQSVHVAVKVPPLIQPFEFPPASIGQLLYIPCWSSGDMPI 
RITWRKDGQVIISGSGVTIESKEFMSSLQISSVSLKHNGNYTCI 
ASNAAATVSRERQLIVRVPPRFW 




1 


474 


FRRC-AGAERGSHREGERGAAGMGEFKVHRVRFFNYVPSGIRCVA 
YNNQSNRLAVSRTDGTVEIYNLSANYFQEKFFPGHESRATSALC 
WAEGQRLFSAGLNC-EIMEYDLQALKIKYAMDAFGGPIWSMAASP | 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 

amino acid 

sequence 


Predicted end 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I = Isoleucine, K=Lysir_e, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








SGSQLLVGCEDGSVKLFQITPDKI PV 


5759 


2 


1240 


GNA.a.FAGQGVi/YETFHMSDLPSYTTNGTTOV/VNNQiGFTTDPR 
MARSSPYPTDVARVWAPIFHVNADDPEAVIYVCSVAAEWRNTF 
NKDVGADLVCYRRRGHNEMDEPMFTQPLMYKQIHRQVPVLKKYA 
D KL I AEGTVTLQEFEEE I AKYDR I CEEAYGRSKDKKI LHIKHWL 
DSPWPGFFNVDGEPKSMTCPATGIPEDMLTHrGSVASSVPLEDF 
KIHTGLSRILRGRADMTKMRTVDWALAEYMAFGSLLKEGIHVRL 
NGQDVERGTFSHRHHVLHDQEVDRRTGVPMNHLWPDQAPYTVCN 
S S LSE YGVLG FELGYAMA3 PNALVLWEAQ FGDFHNTAQ C I IDQF 
ISTGQAKWVRHNGIVLLLPHGMEGMGPEHSSARPERFLQMSNDD 
SDAYPAFTKDFEVSQL 


576G 




1221 


vrditsdslslswtvpegqfdhflvqfkngdgqpkavrvpghed 
gvtisglepdhkykmnlygfhggqrvgpvsavgltapgkdeema 
pastepptpeppikprleeltvtdatpdslslswtvpegqfdhf 
lvqykngdgqpkatrvpghedrvtisglepdhkykmnlygfhgg 
qrvgpvsaigvtaaeeetptptepsmeapeppeepllgeltvtg 
sspdslslswtvp2grfdsftvqykdrdgrpqwrvggeesevt 
vgglepgrkykmhlyglhegrrvgpvstvgvtapqedvdetpsp 
tepgteapeppeepllgeltvtgcspd3lslswtvpqgrfdsft 
vqykdrdgrpqavrvggqeskvtvrglepgrkykmhlyglhegr 
rlgpvsaigvt 








SCDMAEAAALVWIRGPGFGCKAVkLTiKGRCTVRDFIHRHCQDQN 
VPVENFFVKCNGALINTSDTVQHGAVYSLEPRLCGGKGGFGSML 
RALGAQIEKTTNREACRDLSGRRLRDVNIIEKAMAEWVKQQAERE 
AEKEQKRLERLQRKLVEPraCFTSPDYQQQCHEMAERLEDSVLK 
GMCAASSKMVSAEISENRKRQWPTKSQTDRGASAGKRRCFWLGM 
EGLETAEGSNSESSDDDSEEAPSTSGMGFHAPKIGSN3VEMAAK 
FPSGSQRARVWTDHGSPEQLQIPVTDSGRHILEDSCAELGESK 
EHMESRMVTETEETQEKKAESXEPIEEEPTGAGLNKDKETEERT 
DGERVAEVAPEERENVAVAKLQESQPGNAVIDKETIDLLAFTSV 
AELELLGLEKLKCELMALGLKCGGTLQ 


5762 


2 


344 


GSTGQTPLHSQGGGGGSGGGRRRTPRGMPKEKYEPPDPRRMYTI 
MSSEEAANGKKSHWAELE I SGKVRSLSASLWSLTHLTALHLSDH 
SLSRIPSDIAKLHNLVYLDLSSNKIR 


57S3 




429 


LDKDTGLIMLIARliDYELIQRFTLTIIARDGGGEETTGRVRIW 
LDVNDNVPTFQKDAYVGALRENEPSVTQLVRLRATDEDSPPNNQ 
ITYSIVSASAFGSYFDISLYEGYGVISVSRPLDYEQISNGLIYL 
TVMAMDAGN 


5764 






VCARACGEMRQLLRPIDRQRYDENEDLSDVEEIVSVRGF3LEEK 
LRS3LYQGDFVHAMEGKDFNYEYVQREALRVPLIFREKDGLGIK 
MPDPDFTVRDVKLLVGSRRLVDVMDVNTQKGTEMSMSQFVRYYE 
TPEAQRDKL 


5765 


3 


825 


QKILRLNNSHQPPTSSSNSKDCGGPASSGAGATAALADGLKFAS 
V- £ J ? QGNS HKE TS KS KVKRS KTS KDANKS L ? SAALYG I PE I S 
STGKRQEVQGRPGEATGMNSALGQSVSSGGSGNPNSNSTSTSTS 
AATAGAGSCGKSKEEKPGKSQSSRGAKRDKDAGKSRKDKHDLLQ 
' ^C-SQAPSGGHLYGFGAKSNGGGAS PFHCGGTGSGSVAAA 
GEVSKSAPDSGLMGNSMLVKKEEEEEESHRRIKKLKTEKVDPLF 
TVPAPPPHV 


' 5766 


1608 


663 


EG^FSVDPASSQAT/ELSDVTLIEGVGNEVMWAGVWLILALVL 
-P.WLSTYVADSGSNQLLGAIVEAGDTSVLHLGHVDHLVAGQGNPE 
PTELPHPSEGNDEKAEEAGEGRGDSTGEAGAGGGVEPELEHLLD 
IQGLPKRQAGAGSSSPEAPLRSEDSTCIiPPSPGLITVRLKFLKfD 
TEELAVARPEDTVGALKSKYFPGQESQMKLIYQGRLLQDPARTL 
RSLIIITDNCVIHCEJRSPPGSAVPGPSASLAPSATEPPSLGVWG 
SLMVPVFWLLGWWYFRINYRQFFTAPATVSLVGVTVFFSFT.V 
FGMYGR 


57S7 


2 


892 


NFRATPRP -'TRPELRTGTEVILWYLDWRALMKRKRMKANIKLVG 
SGFPLP3SDLDDSLTEEIDEKIGFRNDANFDWQNVADFRDAGGS 
LTEVKV3EEERDPQSPEFEIEEEEEMLSSVIPDSRRENELPDFP 
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ID 
NO: 


Predicted 
beginning 

corresponding 
to first 
amino acid 

sequence 


Predicted end 
nucleotide 
location 
corresponding 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








H _ DE FFTLNS TPSRSAYDEPHLLVKTIEKQKLELEKRRLDIEAEE 
LQVEKERLQIEKERLRHLDMEHERLQLEKERLQIEREKLRLQIV 
NSEKPSLENELGQGEKSMLQPQDIETEKLKLERERLQLEKDRLQ 
FLKFESEKLQ I EKERLQVEKDRLR I QKEGHLQ 


57S8 


3 


476 


SSRSRLSVSVSPPPPGIVELGPPFAWEFCSRLGSAVTSQRAGPA 
AAMVAKDYPFYLTVKRANCSLELPPASGPAKDAEEPSNKRVKPL 
SRVTSLANLIPPVKATPLKRFSQTLQRSISFRSESRPDILAPRP 
WS RNAAPS S TKRRDS KLWS ETFD VC 


57S9 


38 


567 


TKTKaCVKEKATDQSVKAFAEHCPELQYVGFMGCSVTSKGVIHL 
TKLRNLSSLDLRHITELDNETAMEIVKRCKNLISLNLCLWWIIN 
DRCVEVIAKEGQNLKELYLVSCKITDYALIAIGRYSMTIETVDV 
GWCKEITDQGATLIAQSSKSLRYLGLMRCDKVNEVTVEQLVQQY 
PHITFSTVLQDCKRTLERAYQMGWTPNMSAASS 


5770 




484 


DSRRYDVKTRKWSFLLEEHSKLIAKVRCLPQVQLDPLPTTLTLA 
FASQLKKTSLSLTPDVPEADLSEVDPKLVSNLMPFQRAGVNFAI 
AKGGRLLLADDMGLGKTIQAICIAAFYRKEWPLLVWPSSVRFT 
WEQAFLRWLPSLSPDCINWVTGKDRLTA 


5771 


168 


741 


CLLPSACLRARSNREASEGPSSRACSNS.SQr)' KKaCTYSGTSTPS 
FKGSHCSGSDHSSLGLEQLQDYMVTLRSKLGPLEIQQFAMLLRE 
YRIiGLiPIQnYCTGLLKLYGDRRKFLLLGMRP F : PDQD IGYFEGF 
LEGVGIREGGILTDSFGRIKRSMSSTSASAVRSYDGAAQRPEAQ 
AFHRLLADITHDIE 


5772 




383 


E-NLALVSFSIIFQIKAEDDQPLE J 71,1 31 " VITiLTQDNG 
ILTFSNLVTCSAIYHLPVFPEREPGCSMRDLRVA 


5773 


2 


72 3 


PRVRSKHNFCFMEMNTRLQVEHFVir-lU 1 rill, iWQLRIAAGE 
KIPLSQEEITLQGHAFEAR1YAEDPSNNFWPVAGPLVHLSTPRA 
DPSTRIETGVRQGDEVSVHYDPMIAKLWWAADRQAALTKLRYS 
LRQYNIVGLHTNIDFLLNLSGHPEFEAGNVHTDFIPQHHKQLLL 
SRKAAAKESLCQAALGLILKEKAMTDTFTLQAHDQFSPFSSSSG 
RRLNISYTRNMTLKDGKNSK 


5774 


2 


592 


FVEEENIRWRCGGSELNFRRAVFSADSKYIFCVSGDFVKVYST 
VTEECVH I LHGHRNLVTG I QLNPMJHLQL YS CS LDGT I KLWD Y I 
DGILIKTFIVGCKLHALFTLAQAEDSVFV1VNKEKPDIFQLVSV 
KLPKSSSQEVEAKELSFVLDYINQSPKCIAFGNEGVYVAAVREF 
YLS VYFFKKETTS RVTLSS S 


5775 


3 


538 


SSGCCDPAAPSSLAEAATMPVSKCPKKSESLWKGWDRKAQRNGL " 

RSQVYAVNGDYYVGEWKDNVKHGKGTQVWKKKC-AIYEGDWKFGK 

RDGYGTLSLPDQQTGKCRRVYSGWWKGDKKSGYGIQFFGPKEYY 

EGDWCGSQRSGWGRMYYSNGDIYEGQWENDKPNGEGMLRLSQNP 

RP 


577S 


2 


484 


RLPQDCVCQNLSESLGTLCPSKGLLFVPPDIDRRTVELRLGGNF 
IIHISRQDFANMTGLVDLTLSRNTISHIQPFSFLDLESLRSLHL 
DSXRLPSLGEDTLRGLVNLQHLIVNKNQLGGIADEAFEDFLLTL 
EDLDLSYNNLHGPAVGLRGDAWVQPSTS 


5777 


2 


949 


GQDPEEGQDLFQPEREVDPSWGRGREPRLGKLRFQNDHLSVLKQ 
VXKLEQALKDGSAGLDPQLPGTCYSPHCPPDKAEAGSTLPENLG 
3G3GE E VS 2RVHPSDLEGREPTPELVEDRKGSCRRPWDRSLENV 
YRGSEGSPTKPFINPLPKPRRTFKHAGEGDKDGKPGIGFRKEKR 
NLPPLFSLPPPPLPSSPPPSSVNRRLWTGRQKSSADHRKSYEFE 
DLLQSSSESSRVDWYAQTKLGLTRTLSEENVYEDILDPPMKENP 
YEDIELHGRCLGKKCVLNFPASPTSSIPDTLTKQSLSKPAFFRQ 
NSERRNV 


5778 


1 


1210 


QRRQSVSRLLLPVFLLEPPAEPGLEPPPEEEGGEPAGVAEEPGS 
GGPCWLQLEEVPGPGPLGGGGPLRSPSSYSSDELSPGEPLTSPP 
WAPLGAPERPEHLLNRVLERLAGGATRDSAASDILLDDIVLTHS 
LFLPTEKFLQELHQYFVRAGGMEGPEGLGRKQACLAMLLHFLDT 
YQGLLQEEEGAGHIIKDLYLLIMKDESLYQGLREDTLRI.HQLVE 
TVELKIPEENQPPSKQVKPLFRHFRRIDSCLQTRVAFRGSDEIF 
CRVYMPDHSYVTIRSRLSASVQDILGSVTEKLQYSEEPAGREDS 
LILVAVSSSGEKVLLQPTEDCVFTALGINSHLFACTRDSYEALV 
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SEQ 


Predicted 
nucleotide 
corresponding 

residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Amino ac:a segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=Histidine, I=~soleucine , K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








PLPEEIQVSPGDTEIHRVEPEDVANHLTAFHNELFRCVHELEFV 
DYVFHGE 


5779 


13 3 


1S71 


EAVQV^IKHSADVNARDKNWQTPLHVAAANKAVKCAEVTIPILS 
SVTJVSDRGGRTALHHAALNGHVEMVNLLLAKGANINAFDKKDRR 
ALHWAAYKGHLDWALLINHGAEVTCKDKKGYTPLHAAASNGQI 
K" WKHLLNLGVE I DE I NV YGNTALH I AC YNGQDAWNE L I DYGA 
NWQPNNNGF7PLHFAAASTHGALCLELLWNGADVNIQSKDGK 
S PLHMTAVHGRFTRSQTLI QNGGE I DCVDKDGNTPLHVAARYGH 
ELLINTLITSGADTAKCGIHSMFPLHLAALNAHSDCCRKLLSSG 
QKYSIVSLFSNEHVLSAGFEIDTPDKFGRTCLHAAAAGGNVECI 
KLLQSSGADFHKKDKCGRTPUJYAAANCEFHCIETLVTTGAKTVN 
ETDDWGRTALHYA^SDMDRNKTILGNAHDHSEELERARELKEK 
EATLCLEFLLQNDANF S IRDKEG YNS I HYAAAYGHRQCLELLLE 
RTNSGFEESDSGATKSPLHLAVSEMP 


5780 


154 




QFFRVITCLPFKGPDYRLYKSEPELTTVAEVDESNGEEKSEPVS 
EIETSWKGSHFPVGWPPRAKSPTPESSTIASYVTLRK7KKMM 
DLRTERPRSAVEQLCLAESTRPRMTVEECMERIRRHCOACLREK 
KKQLI'JViaASDQSPLQSPSKLRDNP 


5781 


19 


941 


1 PPMR: * i-i 1 iOERAYGGRGPGGAF 
PAPPVSGTCPPDLIYAPTPEKAEGGSQKNHQPPPGERAAHRDGE 
QAPCRAGPTRKVAVAPRPPSCP*GPE\PGEEPRR?LDRSPPLGQ 
VQPHFTSQDAKSAEDEAPSRHLGKHQPRSAQVGSRLDALQGPKT 
QHSIHTVTCKSPRQKEDRSPKPPQAPKIIPEEHGRQSXQAPPPLP 
VAPSRTCGGC*TWDPALLVSP/PQGDSTPELPAP\QQPTGGPSR 
CRQALPPQG*RQQPRQRPR/PTGASRSHPAKAKGCQGPPKIRNY 
NIMD 


5782 


5176 


1237 


DRSMMSMAADSYTDSYTDTYTEAYMVPPLPPEEPPTMPPLPPEE 
PPMTPPLPPEEPPEGPALPTEQSALTAENTWPTEVPSLPSEESV 
SQPEPPVSQSEISEPSAVPTDYSVSASDPSVLVSEAAVTVPEPP 
PEPESS I TLTPVESAVVAEEHEWPERPVTCKVSETPAMSAEPT 
VLASEPPVMSETAETFDSMRASGHVASEVSTSLLVPAVTTPVLA 
ESILEPPAI4AAPESSAMAVLESSAVTVLESSTVTVLESSTVTVL 
EPSWTVPEPPWAEPDYVTIPVPWSALEPSVPVLEPAVSVLQ 
PSMIVSEPSVSVQESTVTVSEPAVTVSEQTQVIPTEVAIESTPM 
ILESSIMSSHVMKGINLSSGDQNLAPEIGMQEIALHSGEEPHAE 
EHLKGDFYESEHGINIDLNIWWHLIAKEMEHNTVCAAGTSPVGE 
IGEEXILPTSETKQRTVLDTYPGVSEADAGETLSSTGPFALEPD 
ATG\TSKGIEFTTASTLSLWKYDVDLSLTTQDTEHDMLISTSP 
^ - / , X_EGPLPAKDIIILDLPSNINLVSSDTNEPLPVKRD\DQ 
TLAZUjI\SLKESSGGEKEVPPPS*REHLPDSGFSANIEDINEAD 
LVRPVSSPRTWNVLPSPRAGL\EGP\LIASDFGPVQNLYSSPW 
\SSMP\ERASGS\SSGEKGG\YEIFVKVKDTHEKSKKNKNRDKG 
EKEXKRDSSLRSRSKRSKSSEHKSRKLTSESRSRARKRSSKSKS 
HRS\QTRSRSRS/RDRRRRSSRSRSKSRGRRSVSKEKRKRSPKH 
R3KSRERKRKR3SSRDNRKTVRAR3RTPSRRSRSKTPSRRRRSR 
SVGRRRSFSISPSRRSRTPSRRSRTPSRRSRTPSRRSRTPSRRS 
P.TPSRRSRTPSRRRRSRSWRRRSFSISPVRLRRSRTPLRRRFS 
RSPIRRKRSRSSERGRSPKRLTDLDKAQLLEIAKANAAAMCAKA 
_ PIILKPAPPETIEEKVAKKSGGATIEELTEKCKQIAQSKE 
DDDVIVNKPHVSDEEEEEPPFYHHPFKTjSEPKPIFFNLNIAAAK 
PTPPKS2VTLTKEFPVSSGSQHRKKEADSVYGEWVPVEKNGEEN 
KBDDNVFS SNLPSEPVDI STAMS ERALAQKRLS ENAFDLEAMSM 
LNRAQERIDAWAQLNSIPGQFTGSTGVQVLTQEQLANTGAQAWI 
.'Kr',.rLRAA?VTGGMGAVLMRKMGWREGEGLGKNKEGNKEPILV 
DFKTDRKGLVAVGERAQKRSGNFSAAMKDLSGKHPVSALMEICN 
KRRWQPPEFLLVHDSGPDHRKHFLFRVLINGSAYQPNCMFFTiNR 
Y 


5783 


1693 


698 


1 TI ; SMFKTPSKLSEKKKSVLCSTPTINIPASPFM 
QKLGFGTGVNVYLMKRSPRGLSHSPWAVKKINPICNDHYRSVYQ 
KRLMDEAKILKSLHHPNIVGYRAFTEAMDGSLCLAMEYGGEKSL 
NDLIEE/PI*SQ/PKILFQQP/LILKVALNMARGLKYLHQEKKL 
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SEQ 
ID 


Predicted 

beginning 

nucleotide 

location 

corresponding 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 

sequence 


Amino acid segment containing signal peptide 
!A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Fhenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan / Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LHGDIKS SNWI KGDFETI KI CDVGVS LPLDENMTVTDPEAC Y I 
GTEPWKPKEAVEENGVITDKADIFAFGLTLWEMMTLSIPHINLS 
NDDDDEDKTFDESDFDDEAYYAALGTRPPINMEELDESYQKVIE 
LFSVCTNEDPKDRPSAAHIVEALETDV 


5784 


2669 


1388 


PRVRPRWTDHNYYISRIYGPSDSASRDLWVNIDQMEKDKVKIH" 

G I LSNTHRQAARVNLSFDFPFYGH FLREI TVATC-GF I YTGEWH 

RM'jTATQYIAPLMANFDPSVSRNSTVRYFDNGTALWQWDHVHL 

QDNYNLGS FTFQATLLMD3RI I FGYKE 1 PVLVTQ ISSTNHPVKV 

GLSDAFVWHRICQIPNVRRRTIYEYHRVELQMSKITNISAVEM 

TPLPTCLQFNRCGPCVSS2IGFNCSWCSKLQRCSSGFDRHRQDW 

VDSGCPEESK3KMCENTEPVET\?J J EPPQP*ERCPPSSGS*LPP 

E/DAVTSQFPTSLPTEDDTKIALHLKDNGASTDESAAEKKGGTL 

HAGLIVGILILVLIVATAILVTVYMYHHPTSAASIFFIERRPSR 

WPAMKFRRGSGHPAYAEVEPVGEKEGFIVSEQC 


5785 


2669 


1388 


PRVRPRVRTDHNYYISR1YGPSDSASRDLWVNIDQMEKDKVKIH'" 

G I LSNTHRQAARVNLS FDFPF YGHFLRE I TVATGGF I YTGEWH 

RMLTATQYIAPLMANFDPSVSRNSTVRYFDNGTALWQWDHVHL 

QDNYNLGS FTFQATLLMDGRI I FGYKEI PVLVTQ1SSTNHPVKV 

GLSDAFWVHRIQQIPNVRRRTIYEYHRVELQMSKITNISAVEM 

TPLPTCLQFNRCGPCVSSQIGFNCSWCSKLQRCSSGFDRHRQDW 

VDSGCPEESKEKMCENTEPVET\FLEPPQP*ERQPPSSGS*LPP 

E/DAVTSQFPTSLPTEDDTKIALHLKDNGASTDDSAAEKKGGTL 

HAGLIVGILILVLIVATAILVTVYMYHHPTSAASIFFIERRPSR 

WPAMKFRRGSGHPAYAEVEPVGEKEGFIVSEQC 


S78el 


2532 


1674 


lli o L.^ (J P PTPTRRRWPAPGRTSRGHRPQM * SGTP 
APRPPARSTVSPASPLPKPRAGRCGSRPRSACSTFRFC*SLN*M 
S*H*KRNLSQRSSSMSRRPLSCARPHR**RQGLTVAARLPTWAK 
SPPLACSFCQAAQKSQSLSSGRSTR*PERMSFRP\SPPGNPAIP 
SLAPSSRP/PKGRPQCTWIPSRWPAS?TAPPTTT*APTSSPGST 
GRSMMTCPTRWTATPWSARASSRPRNWPTP*WRP3GRLSTV*RA 
TGGSTATAPPKRFPRNWNPMMAE 


5787 


2 


1460 


MASAASVTSLADEWCP\ICQGTLKEAGSLSNCG/HKNFCRACL 
T\RYCEIP\GPD\LEESP\TCP\LCKEPFRP\GSFRPNWQLANV 
VENIERLQLVSTLGLGEEDVCQEHGEKIYFFCEDDEMQLCWCR 
EAGEHATHTMRFL3DAA\APYREQ IHKCLKCL I KEREE IQEIQS 
RENKKMQVLLTQVSTKRQQVISEFAHLRKFLEEQQSILLAQL3S 
QDGD ILRQRDE FDLLVAGE I CRFSAL IEELEEKHERPARELLTD 
IRSTLIRCETRKCRKPVAVSPELGQRIRDFPQQALPLQREMKMF 
LEX1CFELDYEPAHISLDPQTSHPKLLLSEDHQRAQFSYKWQNS 
PDNPQRFDRATCVLAHTGITGGRHTWWSIDLAHGGSCTVGWS 
EDVQRKGELRLRPEEGVWAVRLAWGFVSALGSFP\TRLTLKEQP 
RQVRVSLDYEVGWVTFTNAVTREPIYTFTASFTRKVIPFFGLWG 
RGSSFSLSS 


578B 


2 


SB60 


3HSVSGRSSAYGDATAEGHPAGPGSVSSSTGAISTTTGHQEGDG 
SEGEGEGETEGDVHTSNRLHMVRLMLLERLLQTLPQLRNVGGVR 
AIFYMQVILMLTTDLDGEDEKDKGALDNLLSQLIAELGMDKKDV 
SKKNERSALNEVHLWMRLLSVFMSRTKSGSKSSICESSSLISS 
ATAAALLSSGAVDYCLHVLKSLLEYWKSQQNDEEPVATSQLLKP 
HTTSSPPDMSPFFLRQYVKGHAADVFEAYTQLLTEMVLRLPYQI 
KKITDTNSRIPPPVFDHSWFYFLSEYLMIQQTPFVRRQVRKLLL 
FICESKEKYRQLRDLHTLDS\HVRGIKKLLEEQGIFLRASWTA 
SPQSALQYDTLISLMEHLKACAEIAAQRTINWQKFCIKDDSVLY 
FLLQVS FLVDEGVS PVLLQLLS CALCGS KVLRAIAAS SGSSSAS 
SSPAPVAASSGQATTQSKSSTKKSKKEEKEKEKDGETSGSQEDQ 
LCTALVNQLNKFADKETLIQFLRCFLLESNSSSVRWQAHCLTLH 
IYRNSSKSQQELLLDLMWSIWPELPAYGRKAAQFVDLLGYFSLK 
TPQTEKKLKEYSQKAVEILRTQNHILTNHPNSNIYNTLSGLVEF 
DGYYLESDPCLVCNNPEVPFCYIKLSSIKVDTRYTTTQQWKL1 
GSHTISKVTVKIGDLKRTKMVRTINLYYNNRTVQAIVELKNKPA 
RWHKAKKVQLTPGQTEVKIDLPLPIVASNLMIEFADFYENYQAS 
TETLQCPRCSASVPANPGVCGNCGENVYQCHKCRSINYDEKDPF 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
sequence 


Freaicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T= Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LCNACGFCKYARFDFMLYAKPCCAVDPIENEEDRKKAVSNINTL 
LDKADRVYHQLMUHRPQLENLLCKVNEAAPEKPQDDSGTAGGIS 
STS ASVNRYI LQLAQEYCGDCKNS FDELSKI IQKVFASRKELLE 
YDLQQREAATKSSRTSVQPTFTASQYRALSVLGCGHTSSTKCYG 
CASAVTEHCITLLRALATNPALRHILVSQGLIRELFDYNLRRGA 
AAMREEVRQLMCLLTRDMPEATQQMNDLIIGSVSTALKGHWANP 
DIASSLQYEMLLLTDSISKEDSCWELRLRCALSLFLMAVNIKTP 
VWENITLMCLRILQKLIKPPAPTSKKNKDVPVEALTTVKPYCN 
EIHAQAQLWLXREPKASYDANKKCLPIRGIDGNGKAPSKSELRH 
LYLTEKYWRWKQFLSRRGKRTSPLDLKLGHNNWLRQVLFTPAT 
QAARQAACTIVEALATIP3RKQQVLDLLTSYLDELSIAGECAAE 
YLALYQKL I TS AHWKVYIAARGVLPYVGNL ITKEIARLLALEEA 
TLSTDLQQGYALKSLTGLLSSFVEVESIKRHFKSRLVGTVLNGY 
LCLRKLWQRTKLIDETQ3MLLEMLEDMTTGTESETKAFMAVCI 
ETAKRYNLDDYRTPVFIFERLCSIIYPEENEVTEFFVTLEKDPQ 
QEDFLQGRMPGNPYSSNEPGIGPLMRDIKNKICQDCDLVALLED 
DSGMEuLVNNKIISLDLPVAEVYKKVWCTTNEGEPMRIVYRMRG 
LLGDATEEFIESLDSTTDEEEDEEEVYKMAGVMAQCGGLECMLN 
RLAGIRDFKQGRHLLTVLLKLFSYCVKVKVNRQQLVKLEMNTLN 
VMLGTLNLALVAEQESKDSGGAAVAEQVLSIMEI\IQAEPNVEP 
LSEDKGNLLLTGDKDQLVMLLDQINSTFVRSNPSVLCGLLRIIP 
YLSFGEVEKMQILVERFKPYCNFDKYDEEHSGDDKVFL\DCFCK 
IAAGIK\NNSNGHQL \ KDL \ILQKGITQNALD\YMKKHIP/SAA 
RIWDADI\WKSFCLRPALPFILRLLRGLAIQHPGTQVLIGTDSI 
PNLHKLEQVS\SDEGIGTLA\ENL\LESLREHPD - i/NKKIDA\AR 
RETRAEKKRMAMAMRQKALGTLG\MTTNEKGQWD/TRTALLBA 
DWEELIEEP\GLTCCICREGYKFQPTKVLGIYTFTKRWLGGVW 
ENKPRETSRATSTVSHFNIVHYDC\HLA\AVSLARGREEWESAA 
LQNANTKCNGLLPVWGPHVPESAFATCLARHNTYLQECTGQRSP 
TYQLNIHDIKLLFLRFAMEQSFSADTGGGGRESNIHLIPYIIHT 
GLYVLNTTRATSR3EKNLQGFLEQPKEKWVESAFEVDGPYYFTV 
LALHILPPEQWRATRVE1LRRLLVTSQARAVAPGGATRLTDKAV 
KDYSAYRSSLLFWALVDLIYNMFKKVPTSNTEGGWSCSLAEYIR 
HNDMPIYEAADKALKTFQEEFMPVETFSEFLDVAGLLSEITDPE 
SFLKDLLMSVP 


5789 


1 


24 C 7 


LPLHAVEICTGRPGQPALKMPGKLRSDAGLESDTAMKKGETLRKQ 
TSEKEKKEKPKSDKTEEIAEEEETVFPKAKQVKKKAEPSEVDMN 
3PKSKKAKK\KEEPSQNDISPKTKSLRKKKEPIEKKWSSKTKK 
VTKNEEPSEEEIDAPKPKKMKKEKEMNGETREKSPKLKNGFPHP 
-?ECNPSE.AASEESNSEIEQEIPVEQKEG\AFSNFPISEETIKL 
LKGRGVTFLFFIQAKTFHKVYSGKDLIAQARTGTGKTFSFAIPL 
IEKLHG\ELQDRKRGRAPQVLVLAPTRELANQVSKDFSDITKKL 
SVACFYGGTPYGGQFERMRNGIDILVGTPGRIKDHIQNGKLDLT 
KLNKWLDEVDQMLDMGFADQVEEILSVAYKKDSEDNPQTLLFS 
ATCPHWVFNVAKKYMKSTYEQVDLIGKKTQKTAITVEHLAIKCH 
WTQRAAVIGDVIRVYSGHQGRTIIFCETKKEAQELSQNSA1KQD 
AQSLHGD I PQKQRE I TLKGFRNGS FGVLVATNVAARGLDIPE VD 
LVIQSSPPKDVESYIHRSGRTGRAGRTGVCICFYQHKEEYQLVQ 
VEQKAG I XFKRIGVPSATEI IKASSKDAIRLLDSVPPTAISHFK 
QSAEKLIEEKGAVEALAAALAHISGATSVDQRSLINSIIVGFVTM 
ILQC3IHKPNISYAWKELKEQLGEEIDSKVKGMVFLKGKLGVCF 
DVPTASVTEIQEKWHDSRRWQLSVATEQPELEGPREGYGGFRGQ 
REGSRGFRGQRDGNRRFRGQREGSRGPRGQRSGGGNKSNRSQNK 
GQXRSFSKAFG2 


5790 


3786 


1585 


ARRQRDPLQAI.RRRNQELBCQQVDSLLSESQLKEALEPNKRQHIY 
QRCIQLKQAIDENKNALQKLSKADESAPVANYNQRKEEEHTLLD 
KLTQQLQG1AVTISRENITEVGAPTEEEEESESEDSEDSGGEEE 
DAEEEEEEKEENESHKWSTGEEYIAVGDFTAQQVGDLTFKKGEI 
LLVIEKKPDGWWIAKDAKGNEGLVPRTYLEPYSEEEEGQESSEE 
GSE3DVEAVDETAEGAEVK\QRTDPHWSAVQKAISEAGIFCLVN 
HVSFCYLIVLMRNRMETVEDTKGSETGFRAWNVQSRGRIFLVSK 



372 



WO 01/53312 



PCT/US00/34263 



Predicted 
beginning 
nucleotide 

corresponding 



Predicted end 
nucleotide 
location 
corresponding 
first 



Amino acid segment containing signal peptide 
{A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F= Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
Ii=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T= Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 



PVLQQINTVDVLTTMGAIPAGFRPSTLSQLLEEGNQFRANYFLQ 
PELMPSQ1AFKDLMWDATEGTIR3RFSRISLILTLWSCKMIPLP 
GMSIQVLSRHVRLCLFDGNKVLSNIHTVRATWQPKKPKTWTFSP 
QVTRILPCLLDGDCFIRSNSASPDLGILFELGISYIRNSTGERG 
ELSCGWFLKLFDASGVPIPAKTYELFLNGGTPYEKGIEVDPSI 
SRRAHGSVFYQIMTMRRQPQLLVKLRSLNRRSRNVLSLLPETLI 
GNMCSIHLLIFYRQILGDVLLKDRMSLQSTDLISHPMLATFPML 
LEQPDVMDALRSSWAGQES\TLKRSEKR\PKEFLKVPRFLLVYH 
\GCVLPLL/HTPTRLPPFRWAEEETETARWKVITDFLKQNQENQ 
GALQALLSPDGVHEPFDLSEQTYDFLGEMRKNAV 



LRVAEFAGTSR/ IGAGLI3PLHRAPARDHGL1RGGAAPALSVSH ~ 
GN/GKQL/AMSSQGSDDEQIKRENIRSLTMSGHVGFESLPDQLV 
NRSIQQGFCFNILCVGETGIGKSTLIDTLFNTNFEDYESSHFCP 
NVKLKAQTYELQESNVQLKLTIVNTVGFGDQINKEESYQPIVDY 
IDAQFEAYLQEELKIKP.SLFTYHDSRIHVCLYFISPTGHELKTL 
DLLTMKNLDSKVYIIPVIAKftDTVSKTELQKFKIKLMSELVSNG 
VQIYQFPTDDDTIAKTOAWNGQLPFAWGSMDEVKVGNKMVKA 
RQYPWGWQVENENHCDFVKLREMLICTNMEDLREQTHTRHYEL 
YRRCKIiEEMGFrDVGPENKPVSVQETYHAKRHEFHGBRQRKEEE 
MKQMFVQKVKEKEAILKEAERELQAKFEKLKRLHQEERMKLEEK 
RRLLEEEIIAFSKKKATSEIFHSQSFLATGSKLRKDKDRKNSQF 
FVKQKVPEHRRSSSQANFIKKKLEVCFDFAVICFITSIFGEQPQ 
LLIFM3KYFQVQGQYISQSE 



AS - \m< RVFWYWHTCWVMYGIVYTRPCSGDASCIQPY - 

LARRPKLCL\RHSFTTTRSHLGAENNIDLVLNVEDFDVESKFER 
TVWVSVPKKTRNN3TLYAYIFLHHAGVLPWHDGKQVHLVSPLTT 
YMVPKPEEINLLTGESDTQQIEADKKPTSALDEPVSHWRPRLAL 
NVMADNFVFDGSSLPADVHRYMKMIQLGKTVHYLPILFIDQLSN 
RVKDLMVINRSTTELPLTVSYDKVSLGRLRFWIHMQEAVYSLQQ 
FGFSE KDADE VKG I F VDTNLYFLALT FF VAAFHLL FE FLAFKND 
ISFWKKKKSMIGMSTKAVLWRCFSTWIFLFLLDSQTSLLVLVP 
AGVGAAIELWPWKPCALKMTIFWRGLMPEFQFGTYSESERKTEEY 
DTQAMKYLSYLLYPLCVGGAVYSLLNIKYKSWYSWLINSFVNGV 
YAFGFLFMLPQLFVNYKLKSVAHLPWKAFTYKAFNTFIDDVFAF 
IITMPTSHRLACFRDDWFLVYLYQRWLYPVDKRRVNEFGESYE 
EKATRAPHTD 



aaaapspawwcgvfwywhtcwvmygivytrpcsgdasciqpy 
larrpklql\rhsftttrshlgaiinm;dlvlkvedfdveskfer 
tvnvsvpkktknngtt.yayiflhhagvlpwhdgkqvhlvspltt 
ymvpkpeeinlltgesdtqqieaekkptsaldepvshwrprlal 
nvmadnfvfdgsslpadvhrymkmiqlgktvhylpilfidqlsn 
rvkdlmvinrettelpltvsydkvslgrlrfwihmqdavyslqq 
7gfsekdadevkgifvdtnlyflaltffvaafhllfdflafknd 
isfwkkkksmigmstkavlwrcfstwiflflldeqtsllvlvp 
agvgaaielwkvkkalkmtifnrglmpefqfgtyseserkteey 
dtqamkylsyllyplcvggavysllnikykswyswliwsfvngv 
yafgflfmlpqlfvwyklksvahlpwkaftykafntfiddvfaf 
iitmptshrlacfrddwflvylyqrwlypvdkrrvnefgesye 
ekatraphtd 



MGPRLSWLLLLPAALLLHEEHSRAAAKGGCAGSGCGKCDCHGV 
KGQKGERGLPGLQGVIGFPGMQGPEGPQGPPGQKGDTGEPGLPG 
TKGTRGP PGASGYPGNPGLPG I PGQDGP PGPP3I PGCMGTKGER 
GPLGPPGLPGFAGNPGPPGLPGMKGDPGEILGHVPGMLLKGERG 
FPGIPGTPGPPGLPGLQGPVGPPGFTGPPGPPGPPGPPGEKGQM 
GLSFQGPKGDKGDQGVSGPPGVPGQAQVQEKGDFATKGEKGQKG 
EPGFQGMPGVGEKGEPGKPGPRGKPGKDGDKGEKGSPGFPGEPG 
YPGLIGRQGP\QGEKGEAGPPGPPGIVIGTGPLGEKGERGYPGT 



RGFPGTSLPGPSGRDGLPGPPGSPGPPGQPGYTNGIVECQPGPP 
GDQGPPGIPGQPGFIGEIGEKGQKGESCLICDIDGYRGPPGPQG I 
PPGEIGFPGQPGAKGDRGLPGRDGVAGVPGPQGTPGLIGQPGAK 
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ID 
NO: 


Predicted 
beginning 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 

corresponding 
to first 

residue of 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cystaine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








GEPGEFl^DLRLKGDKGDPGFPGQPGMPGRAGSPGRDGHPGLPG 
PKGSPGSVGLKGERGPPGGVGFPGSRGDTGPPGPPGYGPAGPIG 
DKGQAGFPGGPGSPGLPGPKGEPGKIVPLPGPPGAEGLPGSPGF 
PGPQGDRGFPGTPGR\PGL\PGEKGAVG\QPGIGFPGPPGPK3V 
DGLPGDMGPPGTPGRPGFNGLPGNPGVQGQKGEPGVGLPGLKGL 
PGLPGIPGTPGEKGSIGVPGVPGEHGAIGPPGLQGIRGEPGPPG 
LPGSVGSPGVPGTGPPGARGPPGGQGPPGLSGPPGIKGEKGFPG 
FPGLDMPGPKGDKGAQGLPGITGQSGLPGLPGQQGAPGIPGFPG 
3KGEMGVMGTPGQPGSPGPWGAPGLPGEKGD\HGFPGSSGPRGD 
PGLKGDKGDVGLPGKFGSMDKVYMGSMKGQKGDQGEKGQIGPIG 
EKGSRGDPGTPGVPGKDGQAGQPGQPGPKGDPGISGTPGAPGLP 
GPKGSVGGMGLPGTPGEKGVPGIPGPQGSPGLPGDKGAKGEKGQ 
AGFPGIGIPGLRGEKGDQGIAGFPGSPGEKGEKGSIGIPGMPGS 
PGLKGSPGSVGYPGSPGLPGEKGDKGLPGLDGIPGVKGEAGLPG 
TPGPTGPA3QKGEPGSDGIPGSAC-EKGEPGLPGRGFP3FPGAKG 
DKGSKGEVGFPGLAGSPG1PGSKGEQGFM3PPGPQGQPGLPGSP 
GHATEGPK3DRGPQGQPGLPGLPGPMGPP3LPGIDGVKGDKGNP 
GWPGAPGVPGPKGDPGFQGMPGIGGSPGITGSKGDMGPPGVPGF 
QGPKGLPGLQG IKGDQGDQGVPGAKGLPGPPGPPGP YD I IKGEP 
GLPGPEGPPGLKGLQGLPGPKGQQGVTGLVGIPGPPGIPGFDGA 
PGQKCEMGPAGPTGPRGFPGPPGPDGLPGSMGPPGTPSVDHGFL 
VTRHSQT IDDPQCPSGTKI L YHG YSLLYVQGNERAHGQDLGTAG 
SCLRKFSTMPFLFCNINNVCNFASRNDYSYWLSTPEPMPMSMAP 
ITGENIRPFISRCAVCEAPAP^/MAVHSQTIQIPPCPSGWSSLWI 
GYSFVMHTSAGAEGSGBALASPGSCLEEFRSAPFIECIIGRGTCRT 
YYANAYSFWLATIERSEMFKKPTPSTLKAGELRTHVSRCQVCMR 
RT 


5795 


1192 


61 


STRSPTVEYISAHPHILFMLLKGYEAPQIALRCGIMLRECIRHE 
PLAKIILFSNQFRDFFKYVELSTFDIASDAFATFKDLLTRHKVL 
VADFLEQNYDTIFEDYEKLLQSENYVTKRQSLKLLGELILDRHN 
FAI MTKY I S KPENL KLMMNLLRDKS PK I Q FEAFHVFKVFVAS PH 
KTQPIVEILLKNQPKLIEFLSSFQKERTDDEQFADEKNYLIKQI 
RDLKKTAP*RALRDSKR 


5796 


2 


1078 


GRVGWELWCMYISPPKDWWDAGDP3LPIRTPAMIGCSFWNRKF 
FG3IGLLDPGMDVYGGENIELGIKVWLCGGSMEVLPCSRVAHIE 
RKKXPYNSNIGFYTKRNALRVAEWMDDYKSHVYIAWNLPLEKP 
GIDIGDVSERRALRKSLKCKNFQWYLDHVYPEMRRYNNTVAYGE 
LRNNKAKDVGLDQGPLENHTAILYPCHGWGPQLARYTKEGFLHL 
GALGTTTLLPDTRCLVDNSKSRLPQLLDCDKVKSSLYKRWNFIQ 
^~GAIMNKGTGRCLEVENRGIAGIDLILRSCTGQRWTIKISrSIK*R 
EGAGALEPGPQDMAAPPNIWTSCPGGETARGRQVLDGPPRASPG 
QHRDPG 


5797 


2 


891 


PRVRQKTLVDVTLENSNIKDQ1RNLQQTYEASMDKLREKQRQLE 
VAQVENQLLKMKVESSQEANAEVMREMTKKLYSQYEEKLQEEQR 
KHSAEKEALLEETNSFLKA1EEANKKMQAAEISLEEKDQRIGEL 
ERLZERMEKERHQLQLQLLEHETEMSGELTDSDKERYQQLEEAS 
ASLRERIRHLNDMVHCQQKKVKQMVEEIESLKKKLQQKQLLILQ 
LLEKIS FLEGENNELQSRLDYLTETQAKTEVETREIGVGCDLLP 
SQTGRTRE I VMPSRNYTPYTRVLELTMKKTLT 


5798 


644 


115 


"KQPYYEECARLSKIHLEKYPNYKYKPRPKR 
TCIVDGKKLRIGEYKQLMRSRRQEMRQFFTVGQQPQIPITTGTG 
WYPGAITKATTTPSPQMTSDCSSTSASPEPSLPVIQSTYGMKT 
DGGSLAGNEMINGEDEMEMYDDYEDDPXSDYSSENEAPEAVSAN 


5799 


2679 


1435 


LLS tt I XFINLFPETKATIQGVLRAGSQLRNADVELQQRAVEYL 
TLS S VAS TDVLATVLEEMPP FPERES S I LAKLKRKKGPGAGSAL- 
DDGRRDPSSNDING3MEPTPSTVSTPSPSADLLGLRAAPPPAAP 
PASAGAGNLLVDVFDGPAAQPSLGPTPEEAFLSPGPEDIGPPIP 
EADELLNKFVCKNNGVLFEWQLLQIGVKSEFRQNLGRMYLFYGN 
K.rsVCFQNFSPTWHPGDLQTQLAVQTKRVAAQVDGGAQVQQVL 
EC! RDFLTPPLL3VRFRYGGAPQALTLKLPVTINKFFQPTEM 
A^QEFFQRWKQLSLPQQEAQKIFKANHPMDAEVTKAKLLGFGSA 
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ID 
NO: 


Predicted 
beginning 
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corresponding 

amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenyl alanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Ly3ine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=0nknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LLDNVDPNPENFVGAGIIQTKALQVGCLLRLEPNAQAQMYRLTL 
RTSKEPVSRHLCELLAQQF 


5800 


2679 


1435 


i 3 "iffinlfpetkatiqgvlragsqlrnadvelqqraveyl 
tlssvastdvijatvleemppfperessilaklkrkkgpgagsal 
ddgrrdpssndinggmeptpstvstpspsadllglraapppaap 
pasagagnllvdvfdgpaaqpslgptpeeaflspgpedigppip 
eadellnkfvcknngvlfenqllqigvksefrqnlgrmylfygn 
ktsvqfqnfsptwhpgdlqtqlavqtkrvaaqvdggaqvqqvl 
nieclrdfltppllsvrfryggapqaltlklpvtinkffqptem 
aaqdffqrwkqlslpqqeaqkifkanhpmdaevtkakllgfgsa 
lldnvdpnpenfvgagiiqtkalqvgcllrlepnaqaqmyrltl 
rtskepvsrhlcellaqqf 


5801 


3 


1413 


FPRLYHLIPDGEITS-KINRVDPSESLSIRLVGGSETPLVHIII 
QHIYRDGVIARDGRLLPGDIILKVNGMDISNVPHNYAVRLLRQP 
CQVLWLTVMREQKFRSRNNGQAPDAYRPRDDSFHVILNKSSPEE 
QLGIKLVRKVDEPGVFIFNVLDGGVAYRHGQLEENDRVLAINGH 
DLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNS 
NGSWSPGPGERSNTPKPLHPTITCHEKVVNIQKDPGESLGMTVA 
GGASHREWDLP I YVIS VE PGGV I SRD3R I KTGD 1 LLN VDGVELT 
EVSRSEAVALLKRTSSSTVTiKAJjF.VKEYEPQEDCSSPAALDSNH 
NMAPPSDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVG 
GYEEYNGNKPFFIKSIVEGTPAYNDGRIRCGDILLAVNGRSTSG 
MIHACLARLLKELKGRITLTIVSWPGTFL 


5802 


3 


290 


CFSLYQIMERIMDLPTLLRHAFREMFSVGGLFWMFRIRIILCLM 
GAFFYLISPLDFVPEALFGILGFLDDFFVIFLLLIYISIMYREV 
ITQRLTR 


5803 


2234 


1299 


EAQFGTT^F REE [EI ' IGRQRFKVLELRTQSD 
GIQQAKVQILPECVLPSTMSAVQLESLNKCQIFPSKPVSREDQC 
SYKWWQKYQKRKFHCANLTSWPRWLYSLYDAETLMDRIKKQLRE 
WDENLKDDSLPSNPIDFSYRVAACLPIDDVLRIQLLKIGSAIQR 
LRCELDIMNKCTSLCCKQCQETEITTKNEIFSLSLCC-PMAAYVN 
PHGYVHETLTVYKACNLNLIGRPSTEHSWFPGYAWTVAQCKICA 
SHIGWKFTATKKDMSPQKFWGLTRSALLETIPDTEDEISPDKVI 


5804 




1707 


EMEKQRQEEQRKRTEEERKRRIEQDMLEKRKIQR3LAKRAEQIE 
DINNTGTESASEEGDDSLLITWPVKSYKTSGKMKKNFEDLEKE 
REEKERIKYEEDKRIRYEEQRPSLKEAKCLSLVMDDEIESEAKK 
ESLSPGKLKLTFEELERQRQENRKKQAEEEARKRLEEEKRAFEE 
ARRQMVNEDEENQDTAKIFKGYRPGKLKLSFEEMSRQRREDEKR 
KAEEEARRRIEEEKKAFAEARRNMWDDDSPEMYXTISQEFLTP 
GKLEINFEELLKQKMEEEKRRTEEERXHKLEMEKQEFEQLRQEM 
GEEEEENETFGLSREYEELIKLKRSGSIQAKNLKSKFEKIGQLS 
EKEIQKKIEEERARRRAIDLEIKEREAEWFHEEDDVDVRPARKS 
EAPFTHKVNMKARFEQMAKAREEEEQRRIEEQKLLRMQFEQREI 
DAALQKKREEEEEEEGSIMNGSTAEDEEQTRSGAPWFKKPLKNT 
SWDS E PVRFTVKVTGE PKPEITWWFEGEILQDGEDYQ YI ERGE 
TYCLYLPETFPEDGGEYMCKAVNNKGSAASTCILTIESKN 


5805 


3 


776 


YISDTLGQVYKSKIRWWIEENGGNGNISVDDLIALLDLAEHASS 
AFKESQQQSEDREYEVKERLYPKSKRRYDTYNIAGYQGEIEVGL 
YTIQILQLIPFFDNKKELSKRYMVNFVSGSSDIPGDPNNEYKLA 
LKNYI P YLTKLKFSLKKS FDFFDE YFVLLKPRNNI KQNEEAKTR 
RKVAGYFKKYTOIFCLLEESQNNTGLGSKFSEPLQVERCRRNLV 
ALXADKFSGLLEYLIKSQEDAISTMKCIVNEYTFLLK 


5806 


1257 


877 


AVFTFHNHGRTANLYSLHSWLGITTVFLFACQRFLGFAVFLLPW 
ASMWLRSLLKPIHVFFGAAILSLSIASVISGINEKLFFSLKNTT 
RPYHSLPSEAVFAXSTGMLWAFGLLVLYILLASSWKRP 


5807 


22G7 


1302 


RFEKKTFRRPMAVDICPACLGLYCGKTLLFKNGSTEIYGECGVC 
PRGQRTNAQKYCQPCTESPELYDWLYLGFMAMLPLVLHWFFIEW 
YSC-KKSSSALFQHITALFECS.MAAIITLLVSDPVGVLYIRSCRV 
LMLSDWYTMLYNPSPDYVTTVHCTHEAVYPLYTIVFIYYAFCLV 
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ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, ?=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine , R=Arginine, 
S=3erine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








LMMLIRPLLVKKIACGLGKSDRFKSIYAALYFFPILTVLQAVGG 
GLLYYAFPYI ILVLSLVTLAVYMSASE I ENCYDLLVRKKRL I VL 
FSHWLLHAYG 1 1 £ I SRVDKLEQDLjPLLALVPTPALFYLFTAKFT 
EPSRILSEGANGK 


5808 


2 


433 


SLPDSGWEYLSKGGVADNHKDFGELRYNECLMNFSCNGECNGSS 
EGRITHGFQLKSAYENNLMPYTNYTFDFKGVIDYIFYSKTHMNV 
T,GVT,GPT,DPQWLVEN1IITGCPHPHIPSDHFSLLTQLELHPPLLP 
LVNGVHLPNRR 


5809 


464 


2422 


I LVP3FQGI LHPGVYCALQS QHQAQELVAD I DECE VSGLCREGG 
RCVNTHGSFECYCMDGYLPRNGPEPFHPTTDATSCTEIDCGTPP 
EVPDGYIIGNYTSSLGSQVRYACREGFFSVPEDTVSSCTGLGTW 
ESPKLHCQEINCGNPPEMRHAILVGNHSSRLGGVARYVCQEGFE 
SPGGKITSVCTEKGTWRESTLTCTEILTKINDVSLFNDTCVRWQ 
INSRRINPKISYVISIKGQRLDPMESVREETVNLTTDSRTPEVC 
LALYPGTNYTVNISTAPPRRSHPAVIGFQTAEVDLLEDDGSFNI 
SIFNETCLKLNRRSRKVGSEKMYQFTVLGQRWYLANFSHATSFN 
FTTREQVP WCLDLYPTTDYTVNVTLLRS PKRHS VQ I T I ATPPA 
VKQTISNISGFNETCLRWRSIKTADMEEMYLFHIWGQRWYQKEF 
AQEMTFNISSSSRDPEVCLDLRPGTNYNVSLRALSSELPWISL 
TTQITEPPLPEVEFFTVHRGFLPRLRLRKAKEKN3PISSYQVLV 
LPLALQSTFS CDSEGAS S FFSNASDADGYVAAELLAKDVPDDAM 
EIPIGDRLY¥GEYYNAPLKRGSDYCIILRITSEWNKVRRHSCAV 
WAQVKDSSLMLLQMAGVGLGSLAWIILTFLSFSAV 


5810 


3 


1641 


KVFGTHKDHEVSTLDTAISAVKVQLAEFLENj (, A 81RIEAFVS 
EIESFFNTIEENCSKNEKRLEEQNEEMMKKV1AQYDEKAQSFEE 
VKKKKMEFLHEQMVHFLQSMDTAKDTLETIVREAEELDEAVFLT 
SFEEIMERLLSAMESTASLEKMPAAFSLFEHYDDSSARSDQMLK 
QVAVPDPPRLEPQEPNSATSTTIAVYWSMNKEDVIDSFQVYCME 
EPQDDDEWELVEEYRLTVKESYCIFEDLEPDRCYQVWVMAVNF 
TGCSLPSERAIFRTAPSTPVIRAEDCTVCWNTATIRWRPTTPEA 
TETYTLEYCRQHSPEGEGLRSFSGIKGLQLKVNLQPNDNYFFYV 
RAINAFGTSEQSEAALISTRGTRFLLLRETAHPALHISSSGTVI 
SFGERRRLTEIPSVLGEELPSCGQHYWETTVTDCPAYRLGICSS 
SAVQAGALGQGETSWYMHCSEPQRYTFFYSGIVSDVHVTERPAR 
VGI-LLDYNNQRLIFIKAESEQLLFIIRHRFNEGVHPAFALEKPG 
KCTLHLGIEPPDSVRHK 


5811 


1918 


851 


WDIEGAVRRYVQPFLNALGAAGNFSVDSQILYYAMLGVNPRFDS 
ASSSYYLDMHSLPHVINPVESRLGSSAASLYEVLNFLLYVPELA 
HSPLYIQDKDGAPVATNAFHSPRWGGIMVYNVDSKTYNASVLPV 
RVEVDMVRVMEVFLAQLRLLFGIAQPQLPPKCLLSGPTSEGLMT 
WELDRLLWARSVENLATATTTLTSLAQLLGKISNIVIKDDVASE 
VYKAVAAVQKSAEELASGHLASAFVASQEAVTSSELAFFDPSLL 
HLLYFPDDQKFAIYIPLFLPMAVPILLSLVKIFLETRKSWRKPE 
KTD 


5812 


5204 


2744 


GC-KQRCQRGRSCGAREEEVEPGTARPPPAASAMDASLEKIADPT 
LAEMGKNLKEAVKMLEDSQRRTEEENGKKLISGDIPGPLQGSGQ 
DMVSILQLVQNLMHGDEDEEPQSPRIQNI3EQGHMALLGESLGA 
YISTLDKEKLRKLTTRILSDTTLWLCRIFRYENGCAYFHEEERE 
GLAKICRLAIHSRYEDFWDGFNVLYnKKPVIYLSAAARPGLGQ 
YLCNQLGLPFPCLCRVPCHTVFGSQHQMDVAFLEKLIKDDIERG 
RLPLLLVANAGTAAVGHTDKIGRLKELCEQYGIWLHVEGVNLAT 
LALGYVSSSVLAAAKCDSMTMTPGPWLGLPAVPAVTLYKHDDPA 
LTLVAGLTSNKPTDKLRALPLWLSLQYLGLDGFVERIKHACQLS 
QRLQESLKKVNYIKILVEDELSSPVWFRFFQELPGSDPVFKAV 
PVPNMTPSGVGRERHSCDALNRWLGEQLKQLVPASGLTVMDLEA 
EGTCLRFSPLMTAAVLGTRGEDVDQLVACIESKLPVLCCTLQLR 
SEFKQEVEATAGLLYVDDPNWSGIGWRYEHANDDKSSLKSYPQ 
GENIHAGLLKKLNELESDLTFKIGPEYKSMKSCLYVGMASDNVH 
AAELVETIAATAREIEDNSRLLENMTEWRKGIQEAQVELQKAS 
3ERLLEEGVLRQIPWGSVLNWFSPVQALQKGRTFNLTAGSLES 
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ID 
NO: 


Predicted 
beginning 

location 
corresponding 

residue of 
sequence 


Predicted end 
nucleotide 

corresponding 
to first 

residue of 
amino acid 
sequence 


Amine acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion! 








T3PIYVYKAQGAGVTLPPTPSGSRTKQRLPGQKPFKRSLRGSDA " 

LSETSSVSHIEDLEKVERLSSGPEQITLEASSTEGHPGAPSPQH 

TDQTEAFQKGVPHPEDDHSQVEGPESLR 


5813 




699 


HRDGVSGSLERPLTDRSRTGAFAQQRGKMATAGGGSGADPGSRG 
LLRLLSFCVLLAGLCRGNSVERKIYIPLNKTAPCVRLLNATHQI 
GCQSSISGDTGVIHWEK3EELQWVLTDGPNPPYMVLLESKHFT 
RDLMEKLKGRTSRIAGLAVSLTKPSPASGFSPSVQCPNDGFGVY 
SNSYGPEFAHCREIQWNSLGNGLAYEDFSFPIFLLEDENETKVI 
KQCYQDHNLSQMGSAPTFPLCAMQLFSHMAWLSFSTAT\CMRRS 
SIQSTFSINPKIVCDPLSDYNVWSMLKPINTTGTLKPDDRVWA 
ATRLDSRSFFWW\APGAESAVASFVTQLAAAEALQKAPDVTTL 
PR1IVMFVFFQGETFD Y I GS SRMVYDMEKGKFPVQLENVDS FVEL 
GQVALRTSLELWMHTDPVSQKNESVRNQVEDLLATLEKSGAGVP 
AVI LRRPNQSQPLPPS S LQRFLRARN I SGWLADHSGAFHKTKYY 
QSIYDTAENINVSYPBWLEPLKE/ETWNFG*QDTAKALADVATV 
LGRALYELAGGTMFSDTVQADPQTVTRLLYG\FLIKANNSWFQS 
ILQGRDLRSYLG*RGLFQH\YIAV\SSPTNTIYV/VLQYALANL 
TGTWNLTREQCQDPSKVPSENKDLYEYSWVQGPLHSNETDRLP 
RCVRSTARLARALSEAFELSQWSSTEYSTWTESRWKDIRARIFL 
IASKELELITLTVGFC-ILIFSLIVTYCINAKADVLFIAPREPGA 
VSY 


5814 


asoo 


432 


ALKCRPRRVLAI LVGPVQPDKMAE KGAVAVCVRVEPLNSREESI, 
GETAQV YWKTHNNVI Y PVDG S KS FNFDRVLHGNE T PKNVYE A\ I 
AAPIIDSAIQGYNGTIFA\YGQT\ASGKTYTMMGSEDHLGVIPQ 
3QFHGHFSQKI * EVFLDREFLLRVSYMEIYNETITDLLCGTQKM 
KPLIIREDVNRNVYVADLTEEWYTSEMALKWITKGEKSRHYGE 
TKMNQRSSRSHTIFRMILESREKGEPSNCEGSVKVSHLNLVDLA 
GSERAAQTGAAGVRLKEGCNINRSLFILGQVIKKLSDGQVGGFI 
NYRDSKLTRILQNSLGGNPKTRIICTITPVSFDETLTALQFAST 
AKYMKNTPYVNEVSTDEALLKRYRKEIMDLKKQLEEVSLETRAQ 
AMEKDQLAQLLEEKDLLQKVQNEKIENLTRMLVTSSSLTLQQEL 
KAKRKRRVTWCLGKINKMKNSNYADQFNIPTNITTKTHKLSINL 
LREIDESVCSESDVFSNTLDT1SEIEWNPATKLLNQENIESELN 
SLRADYDNLVLDYEQLRTEKEEMELKLKEKNELDEFEALERKTK 
KDQEMQLIHEISNLKNLVKHREVYNQDLENELSSKVELLREKED 
QIKKLQEYIDSQKLENIKMDLSYSLESIEDPKQMKQTIjFDAETV 
ALDAKRESAFLRSENLELKEKI1KELATTYKQMENDIQLYQSQLE 
AKKKMQVDLEKELQSAFNEITKLTSLIDGXVPKDLLCNLELEGK 
ITDLQKELNKEVEENEALRKEVILLSELKSLPSEVERIjRKEIQD 
KSEELHIITSEKDKLF3EWHKESRVQGLLEEIGKTKDDLATTQ 
SNYKSTDQEFQNFKTLHMDFEQKYKMVLEENERMNQEIVNLSKE 
AQKFDSSLGALKTELSYKTQELQEKTREVQERLNEMEQLKEQLE 
NRDSPLQTVEREKTLITEKLQQTLEEVKTLTQEKDDLKQLQESL 
QIERDQLKSDIHDTVNMNIDTQEQLRNALESLKQHQETINTLKS 
KISEEVSRNLHMEENTGETKDEFQQKMVGIDKKQDLEAKNTQTL 
TADVKDNEriEQQRKIFSLIQEKNELQQMLESVIAEKEQLKTDL 
KENI3KTIENQEELRLLGDELKKQQEIVAQEKNHAIKKEGELSR 
TCDRLAEVEEKLKEKSQQLQEKQQQLLNVQEEMSEMQKKINEIE 

IS i c^KIJKELTLBHMETERLEIiAQKLNENYEEVKSITKERKVL 

KELQKSFETERDHLRGYIREIEATGLQTKEELKIAHIHLKEHQE 
TIDELRRSVSEKTAQIINTQDLEKSHTKLQEEIPVLHEEQELLP 
NVKKVSETQETMNELELLTEQS TTKDSTTLAR I EKERLRLNEKF 
QESQEEIKSLTKERDNLKTIKEALEVKHDQLKEHIRETLAKIQE 
SQSKQEQSLNMKEKDNETTKIVSEMEQFKPKDSALLRIEIEMLG 
LSKRLQESHDEMKSVAKEKDDLQRLQEVLQSESDQLKENIKEIV 
AKH1ETEEELKVAHCCLKEQEET INELRVNLSEKETE ISTIQKQ 
LiAIMDKLQNKIQEIYEKEEQLNIKQISEVQEKVNELKQFKEHR 
KAKD3ALQS I ES KMLELTNRLQE SQEEIQIMI KEKEEMKRVQEA 
LQIERDQLKENTKEIVAKMKSSQEKEYQFLKMTAVNETQEKMCE 
IEHLKEQFETQKLNLENIETENIRLTQILHENLEEMRSVTKERD 
DLRSVEETLCTERDQLKENLRETITRDLEKQEELKIVHMH.il KEH 
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SEQ 
ID 
NO: 


Predicted 

nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=e/steine, D=Aspartic Acid, E= 
Glutamic Acid, ^Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=01utamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








QETIEKLRGIVSEKTNEISNMQKDLEHStlDALKAQDLKIQEELR 
IAHMHLKEQQETIDK1RGIVSEKTDKLSNMQKDLENSNAKLQEK 
IQELKANEHQLITLKKDVMETQKKVSEMEQLKKQIKDQSLTLSK 
LEIENLNLAQKLKENLEEMKSVMKERDNLRRVEETLKLERDQLK 
E S1QETKARDLE I QQELKTAP.MLS KEHKETVDKLREKISEKT I Q 
ISDIQKDLDK3KDELQKKIQELQKKELQLLRVKEDVNMSHKKIN 
EMEQLKKQFEPNYLCKCEMDNFQLTKKLHESLEEIRIVAXERDF, 
LRR I KES LKMEREQF" ATLREM IARDRQNHQVKPEKRLLSDGQQ 
HLMESLREKCSRIKELLKRYSEMDDHYECLNRLSLDLEKEIEFH 
RIMKKLKYVLSYVTKIKEEQHECINKFEMDFIDEVEKQKELLIK 
IQHLQQDCDVPSRELRDLKLNQNMDLHIEEILKDFSESEFPSIK 
TEFQQVLSNRKEMTQFLEEWLNTRFDIEBILKIvIGIQKENDRICQV 
NNFFNNRIIAIMNESTEFEERSATISKEWEQDLKSLKEKNEKLF 
KNYQTLKTSLASGAQTOPTTCDNKNPHVTSRATQLTTEKIRELE 
NS LHE AKE S AMHKE S KI I KMC KELE VTND 1 I AKLQAKVHE SNKC 
LEKTKETIQVLQDKVALGAKPYKEEIEDLKMKLGKIDLEKMKNA 
KEFEKEISATKATVEYQK3VIRLLRENLRRSQQAQDTSVISEHT 
DPQPSNKPLTCGGGSGI VQNTKALI LKSEH IRLEKE I SKLKQQN 
EQLIKQKNELLSNNQHLSNEVKTWKERTLKREAHKQVTCKWCFL': 
SPKVTGTASKKKQITPSQCKERNLQDPVFKESPKSCFFDSRSKS 
LPSPHPVRYFDNSSLGLCPEVQNAGA3SVDSQP\GPKARLFQGK 
DVP\ECKTQ 






1460 


SELVMWTVQNRESLGLLSFPVMITMVCCAHSTNEPSKMSYVKET 
VDRLLKGYD IRLRPD FGGP PVDVGMR I DVAS I DMVSE VMMD YTL 
TMYi'UUSKKDKRLSYSGIPLNLTLDNRVADQLWVPDTYFLNDKK 
S F VHGVTVKNRM I RLHP DGTVL YGLR I TTTAACMMDLRRYP LDE 
QNCTLEIESYGYTTDDIEFYWNGGEGAVTGVNKIELPQFSIVDY 
KMVSKXVEFTTGAYPRLSLSFRLKRNIGYFILQTYMFSTLITIL 
SWVSFWINYDASA^RVALGITTVLTMTTISTHLRETLPKIPYVK 
AI D I YLMGCFVFVFLALLE YAFVNY I FFGKGPQKKGASKQDQSA 
NEKNKLEMNKVQVDAHGNILLSTLEIRNETSGSEVLTSVSDPKA 
TMYSYDSASIQYRKP1SSRE\A*GRAPDRHGVPSKGRIRRRAS\ 
QLKVKIPDLTDVWS IDKWSRMFFPITFSLFNWYWLYYVH 


5816 


861 


191 


TSSRSRAAAQEGDAETPGSVERRGRRAGAEDGMSQAPGAQPSPP 
TVYHERQRLELCAVHALNNVLQQQLFSQEAADE I CKRLAPDS RL 
NPKRSLLGTGNYD , /NVIMAAIjQGLGLAAVWWDRRRPLiSQIiAL?Q 
VLGL I1JILPS PVS LGLLS LPLRRRHLRWPCARL/VTVS YYNLDS 
K\LRAPEGPGGLRTE\*GPFLAAALAQGLCEVLLVVTKEVEEKG 
SWLRTD 


5817 


851 




RGPGANRGRSCRGCSGGREPSGGALPKRHCPC*PPSPPAAD 
VMSNTTVPNAPQANSDSMVGYVLGPFFLITLVGVWAWMYVQK 
XKRVDRLRHHLLPMYSYDPAEELHEAEQELLSDMGDPKW\QAG 

'i "TSGGHCWMSRRDLTPLPHPSEPGVLDCLGPCHLLPLLSP 
GSPCWVLGLHFSLHPPSAASASHALTITSLPPGLLPFVGVELTA 
HPQALMGRGFPSGMAAAGRHLCFL 




3 


3918 


> - t nrlwiflvqsfyavrhteswklmstddqqkiqaaafdkgd 
h 1 jki'.pifsssqqrkqvsdsgdikikswrgnnkkecwsylst 
nkkmksdglgasghssstnrwsinktlkqddvkekdgtkiaski 
t::elktgg:<nvsgkpktvtksktengdkarlenmsprqwersa 
taaaaatgqknllkgkgvrnqegqisgarpkvltgnlnvqakak 
plkkatgkdspclsiagpssrstdsskefsistecldepkengs 
teeekpsghklsfcdspgqmmknsvdsvknstvaiksrpvsrvt 
ngtsnkksiheqdtnvnnsvlkkvsgkgcsepvpqailkkrgts 
hgctaaqqrtkstpsnltktqgsqgespnsvkssvssrqsdenv 
akldhktttekqapkrkmvkqratalpkvnakivampknlnqsk 
kgetlnnkdskqkmppgqv1sktqpssqrplkhetstvqksmfh 
dvrdnknkdsvseqkphkplinlaseisdaealqsscrp\dpqk 
plndqekeklalecqniskldkslkheleskqicldksetkfpn 
hketddcdaanicchsvgsdntoskfysttalkymvshpnensl 
nsnpvcdldstsagqihlisdrenqvgrkdtnxqssikcvedvs 
lcnpertngtlxsaqedkkskvpvegltipsklsdesamdedkh 



378 



WO 01/53312 



PCT/US00/34263 



SEQ 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 

residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine ; V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possiiile nucleotide insertion) 








ATADSDVSSKCFSGQLSEKNSPKNMETSESPE3HETPETPFVGH 

GSNDRGISKCGTMLCHDFLGRSSSDTSTPEELKIYDSNIiRIEVK 
MKKQSSNDLFQVNSTSDDEIPRKRPEIWSRSAIVHSRERENIPR 
GSVQFAQE I DQVSSS ADETEDERSEAENVAENFS I SNPAPQQFQ 
GIINLAFEDATENECREFSAKKKFKRSVLLSVDECEELGSDEGE 
VHTPFQASVDSFSPSDVFDGISHEHHGRTCYSRFSRESEDNILE 
CKQNKGNSVCKNESTVLDLSSIDSSRKNKQSVSATEKKNTIDVL 
SSRSRQLLREDKKVNNGSNVENDIQQRSKFLDSDVKSQERPCHL 
DLHQREPNSDIPKNSSTKSLDSFRSQVLPQEGPVKESHSTTTEK 
ANIALSAGDIDDCDTLAQTRMYDHRPSKTLSPIYEMDVIEAFEQ 
KVESETHVTDMDF*DDQHFAKQDWTLLKQLLSEQDSMLDVTNSV 
PEDLSLAQYLIMQTLLLARDSSKPQGITHIDTLNRWSELTSPLD 
SSASITMASFSSEDCSPQGEWTILELETQH 


5819 


1 


5557 


AAAGLLGALHLVMTLWAAARAEKEAFVQSESriEVLRFDDGGL 
LQTETTLGLSSYQQKSISLYRGNCRPIRFEPPMLDFHEQPVGMP 
KMEKVYLHNPSSE*TITLVSIFATTSHFHASFFQNRKILPGGNT 
SFDVS/VFLARWGNVENTLFINTSNHGVFTY\OVFGVGVPNPY 
RLRPFIiGARVTVNSS FSP I INIHNPHSEPLQWEMYSSGGDI ,HL 
ELPTGQQGGTRKLWEIPPYETKGVMRASFSS5EADNHTAFIRIK 
TNASDSTEFIILPVEVEVTTAPGIYSSTEMLDFGTLRTQDLPKV 
T,NT,HLLNSGTKDVPITSVRPTPQ\NDAITVHFKPITLKAS\ESK 
YTKVASISFDAS KAKKPSQFSGKITVKAKEK.SYSKLEIPYQAEV 
LDGYLGFDHAATLFHIRDSPADPVERPIYLTNTF3FAILIHDVL 
LPEEAKTMFKVHNFSKPVLILPNESGYIFTLLFMPSTSSMHIDN 
NILLITNASKFHLPVRVYTGFLDYFVLPPKIEERFIDFGVLSAT 
EASNILFAIINSNPIELA1KSWHIIGDG\LSIELVAVDRGNRTT 
IISSLPECEKSSSSDQSSVTLASGYF\AVFRVKLTAKKL\EGIH 
DGAIQITTDYEILTIPVK\AVIAVGSLTCSPKHWLPPSFPGKI 
VHQSLNIMNSFSQKVKIQQIRSLSEDVRFYYKRLRGNKEDLEPG 
KKSKIANIYFDPGLQCGDHCYVGLPFLSKSEPKVQPGVAMQEDM 
WEADWDLHQSLFKGWTGIKENSGHRLSAIFEVNTDLQKNIISKI 
TAELSWPSILSSPRHLKFPLTNTNCSS\EEEITLENP/SQDVPV 
YVQFIPLALYSNPSVFVDKLVSRFKLSKVAKIDLRTLEFQVFRN 
SAHPLQSSTGFMEG\LSPHLILNLILKPGEKKEVKVK\FTPVHN 
RTVSSLIIVRNNLTVMDAVMVQGCGTTENLRVAGKLPGPGSSLR 
FKITEALLKDCTDSLKLREPNFTLKRTFKVENTGQLQIHIETIE 
ISGYSCEGYGFKVTOCQEFTLSANASRDIIILFTPDFTASRVIR 
ELKFITTSGSEFVFILNASLPYHMLATCAEALPRPNWELALYII 
ISGIMSALFLLVIGTA\YLEAQGIWEP\FRRRLS\FEASNPPFD 
VGRPFDLRRIVG1SSEGNLNTLSCDPGHSRGFCGAGGSSSRPSA 
GSHXQ*G?SGHPHSSHSNRNSADVDDVRAYNSGRTSSMTSAQAA 
SSQPANKTRPLVLDSNTGAQGHSAGRKSKGAKQSQHGSQHHAHS 
PLECHPQPPLPPPVPQPQEPQPERLSPAPLAHPSHPERASSARH 

KPLQRKVKPPKKQEEKEKKGKGKPQEDELKDSLADDDSSSTTTE 

TSNPDTEPLLKEDTEKQKGKQAMPEKHESEMSQVKQKSKKLLNI 

KKEIPTDVKPSSLELPYTPPLESKQRRNLPSKIPLPTAMTSGSK . 

SRNAQKTKGTSKLVDNRPPALAKFLPNSQELGNTSSSEGEKDSP 

PPEWDSVPVHKPGSSTDSLYKLSLQTLNADIFLKQRQTSPTPAS 

PSPPAAPCPFVARGSYSSIVWSSSSSDPKIKQPNGSKHKLTKAA 

S :_, P 3 GWP TFAAVTAG YDKS PGGNGFAKVS SNKTGFS S S LG I S 

HAPVDSDGSDSSGLWSPVSNPSSPDFTPLNSFSAFGNSFNLTGE 

VF3KLGLSRSCNQASQRSWNEFNSGPSYLWESPATDPSPSWPAS 

SGSPTETATSVLGNTSGLWSTTPFSSSIWSSNLSSALPFTTPAN 

TLASIGLMGTENSPAPHAPSTSSPADDLGQTYNPWRIWSPTIGR 

RSSDPWSNSHFPHEN 


5820 


310 


1270 


RVSLSGPVS LGVLLCARS STMGKRDNRVAYMNP I AMARS RGP I Q 
S SGPTIQ\VI * IDQGLPGKK* KSN*KRKRK/DSKALAEFEEKMN 
ENWKKELEKHREKLLSGSESSSKKRQRKKKEKKKSW*\DSSSS\ 
SSSSDSSSSSSDSEDEDKKQGKRRKKKKNRSHKSSESSMSETES 
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SEQ 
ID 
NO: 


Predicted 
beginning 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

corresponding 
to first 

sequence 


Amino acid segment containing signal peptide 
<A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leueine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








DSKDSLKKKKKSKDGTEKEKDIKGLSKKRKMYSEDKPLSSESLS 
E S E Y 1 3 E VRAKKKKS S E EREKATE KT KKKKKHKKHS KKKKKKAA 
SSSPDSP*H*EKS3FPYKESAMSEEISWKTTTYLLKCMNFLVF 
GIIPGLFSSHSDATV 


5B21 


179 


915 


KKRNQSWRWPKPGTNWMLSCSVCWRRVTWTGSVWMKKLGKHPQT 
PT/IKDCSIAATGKRPSARFPHQRRKKRREMDDGLAEGGPQRSN 
TYl r IKLFDRSVDLAQFSEHTPLYPICRAWMRNSP3VRERECSPS 
SPLPPLPEDEEG\SEVTNSKSR*CVQACPPTHTPGGQPKNACR\ 
SR1PSPLAALRMQGTP*RWSPFEPEPSPSTLIYRNMQRWKRIRQ 
RKKEASHRNQLRYSESMKILREMYERQ 


5322 


464 


4379 


QTLKEMPIVMARDLEETASSSEDEEVISQEDHPCIMWTGGCRRI 
PVLVFHADAILTKDNNIRVIGERYHLSYKIVRTDSRLVRSILTA 
HGFHEVHPSSTDYNLMWTGSHLKPFLLRTLSEAQKVNHFPRSYE 
LTRKDRLYKNIIRMQHTHGFKAFEILPQTFLLPAEYAEFCNSYS 
KDRGPWIVKPVASSRGRG\VYLINNPNQISLEENILVSRYINNP 
LLIDDFKFDVRLYVLVTSYDPLVIYLYEEGLARFATVRYDQGAK 
NIRNQFMHLTNYSVNKKSGDYVSCDDPEVEDYGNKWSMSAMLRY 
LKQEGRDTTALMAHVEDLI I KTI ISAELAIATACKTFVPHRS SC 
FELYGFDVLIDSTIiKPWLLEVNLSPSLACDAPLDLKIKASMISD 
MFTWGFVCQDPAQRASTRPIYPTFESSRRMPFQKPQRCRPLSA 
SDAEMKKLVGSAREKGPGKLGGSVLGLSMEEIKVLRRVKEENDR 
RGGFIRIFPTSETWEIYGSYLEHKTSMNYMLP.TRLFQDRMTADG 
APELKI*SLNSKAKLHAALYERKLLSLEVRKRRRRSSRLRAMRP 
KYPVITQPAEMNVKTETESEEEEEVALDNEDEEQEASQEESAGF 
LRENQAKYTPSLTALVENTPKENSMKVREWNKKGGHCCKLETQE 
LEPKFMLMQILQDKGNLSKMQARIAFSAYLQHVQI\RLMKDSGG 
QTFSASWAAKEDEQMELWRFLKRASNNLQHSLRMVLPSRRLAL 
LERTRILAHQLGDFIIVYNKETEQMAEKKSKKKVEEEEEDGWM 
ENFQEFIRQASEAELEEVLTFYTQKNKSASVFLGTHSKISK1WN 
NYSDSGAKGDHPETIMEEVKIKPPKQQQTTEIHSDKL3RFTTSA 
EKEAKLVY3NSSSGPTATLQKIPNTHLSSVTTSDLSPGPCHHSS 
LSQIPSAIPSMPHQPTILLNTVSASASPCLHPGAQNIPSPTGLP 
RCRSGSHTIGPFSSFQSAAHIYSQKLSRPSSAKAGSCYLNKHHS 
G I AKTQKEGEDASLYS KRYNQSMVTAELQRLAEKQAARQYS PSS 
HINLLTQQVTNLNLATGIINRSSASAPPTLRPIISPSGPTWSTQ 
SDPQAP3NHSSSPGSR3LQTGGFAWEGEVENNVYSQATGWPQH 
KYHPTAGSYQLQFALQOLEQQKLQSRQLLDQSRARHQAIFGSQT 
LI 3 Li'TKJilNGAGCRISSATASGQKPTTIiPQKWPPPSSCASIj 
VPKPPPNHEQVLRRATSQKASKGSGAEGQLNGLQSSLNPAAFVP 
~ TS STDPAHTKIMNHKHTEKQPVHHSWVHD 


5823 


42 


2293 


LLTALSMEGGGGRDEPSACRAGDVNMDDPKKEDILLLADEKFDF 
DLSLSSSSANEDDEVFFGPFGHKERCIAASLELNNPVPEQPPLP 
TSE3P FAWS PLAGE KFVEVYKEAHLLALH IESSS RNQAAQAAKP 
EDPRSQGVERFIQESK?\KINLFEKEKEMKKSPTSLKRETYYLS 
DSPLLC-PPVGEPRLLASSPALPSSGAQARLTRAPGPPHSAHALP 
RESCTAHAASQAATQRKPGTKLLLPRAASVRGRGIPGAAEKPKK 
E I PAS FSRTKI PAEKESHRDVLPDKPAPGAVNVPAAGSHLGQGK 
RAIPVP\NKLGLKKTLLKAPGSYSN\LQRKSSSGA\VWSGASSA 
CTPQ?VAKAKSSEFASIPAN*LPGLCPNISKS\GRMGPAMLRPA 
L\PAGPVG\ASSWQAKRVDVSELAAEQLTAPP\SASPTQPQTPE 
i . \ WLNSSCAWSESSQLHKTRSIRRRDSCLNSKTKVMPTPTN 
QFKIPKFSIGDS\PDSSTPKLSRAQRPQSCTSVGRVTVHSTPVR 
P.SSGPAPQSLLSAWRVSALPTPASRRCSGLPPMTPKTMPRAVGS 
PL\CVPARRRSSEPRKKSAMRTEPTRESNRKTDSR\LYDVSPDR 
GSPPSRVPQALNFSPEESDSTFSKSTATEVAREEAKPGGDAAPS 
EALLVDIKLEPLAVTPDAASQPLIDLPLIDFCDTPEAHVAVGSE 
SRPLIDLMTNTPDKNKWAKPSPWGQLIDLSSPLIQLSPEADK 
ENVDSPLLKF 


5824 




2293 


LLTALSMEGGGGREEPSACRAGDVMMDDPKKEDILLLADEKFDF 
DLSLSSSSANEDDEVFFGPFGHKERCIAASLELNNPVPEQPPLP 
TSESPFAWSPLAGEKFVEVYKEAHLLALHIESSSRNQAAQAAKP 
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SEQ 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 


Predicted end 
nucleotide 
location 
corresponding 

residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Fhenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Prcline, Q=Glutamine, H=Arginine, 
S=3erine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosins, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








EDPRSQGVERFIQESKF\KINLFEKEKEMKKSPTSLKRETYYLS 
DSPLLGPPVGEPRLIASSPALPSSGAQARLTRAPGPPHSAHALP 
RES CTAHAAS QAATQRKPGT KLLLPRAAS VRGRG I PGAAEKP KK 
E I PASPSRTKI PAEKESHRDVLPDKPAPGAVNVPAAGSHLGQGK 
RAIPVP\NKLGLKKTLLKAPGSYSN\LQRKSSSGA\VWSGASSA 
CTPQPVAKAKSSEFASIPAN*LPGLCPNISKS\GRMGPAMLRPA 
L\PAGPVG\ASSWQAKRVDVSELAAEQLTAPP\SASPTQPQTPE 
GGG\QWLNSSCAW£ESSQLNKTRSIRRRDSCLN£KTKVMPTPTN 
QFKIPKFSIG3S\PDSSTPKLSRAQRPQSCTSVGRVTVHSTPVR 
RSSGPAPQSLLSAWRVSALPTPASRRCSGLPPMTPKTMPRAVGS 
PL\CVPARRRSSEPRKNSAMRTEPTRESNRKTDSR\LVDVSPDR 
GSPPSRVPQALNFSPEESDSTFSKSTATEVAREEAKPGGDAAPS 
EALLVDIKLEPLAVTPDA.ASQPLIDLPLIDFCDTPEAHVAVGSE 
SRPLIDLMTNTPDMNKNVAKPSPWGQLIDLSSPLIQLSPEADK 
ENVDSPLLKF 


5825 


2 


4210 


flqiesaspapfssgflaahphspggslatkgrsrlsapgmlhl 
saappapppevtatarpclcsvgrrgdggkmaaagaler3fvel 
sgaererprhfreftvcsigtanava3avkysesaggfyyvesg 
klfsvtrnrfihwktsgdtlelmeesldinllnnairlkfqncs 
vlpggvyvsetqnrvi ilmltnqtvhrlllphpsrmyrselwd 
sqmqsiftdigkvdftdpcnyql;pavpgispkstastawlssd 
gealfalfcasggifvlklppydzpgmvswelkqssvmqrllt 

RWMPTATRGDQSPSDRPLSLAVHCVEHDAFIFALCQDHKLRMWS 
YKEQMCLKVADMLEYVPVKKDLRLTAGTGHKLRLAYSPTMGLYL 
GIF\MHAPKRGQFCIFQLVSTESNRYSLDIIISSI,FTSQETLIDF 
ALTSTDIWALWHDAENQTWKYINFEHNVAGQWNPVFMQPLPEE 
EIVIRDDQDPREMYLQSLFTPGQFTN3ALCKALQIFCRGTERNL 
DLSWSELKKEVTLAVENELQGSVTEYEFSQEEFRNLQQEFWCKF 
YACCLQYQEALSHPLALHLNPHTNMVCLLKKGYLSFLIPSSLVD 
HLYLLPYENLLTEDETTISDDVDIARDVICLIKCLRL1EESVTV 
DMSVIMEMSCYNLQSPEKAAEQ1LEDMITIDVENVMEDICSKLQ 
EIRNPIHAIGLLIREMDYETEVEMEKGFNPAQPLNIRMMLTQLY 
GSNTAGYIVCRGVHKIASTRFLICRDLLILQQLLMRLGDAVIWG 
TGQLFQAQQDLLHRTAPLLLSYYLIKWGSECLATDVPLDTLESN 
LQHLSVLiELTDSGALMANRFVSSPQTIVELFFQEVARKHIISHL 
FSQPKAPLSQTGLNWPEMITAITSYLLQLLWPSNPGCLFLECLM 
GNCQYVQLQDY I QLLHPWCQVNVGS CRFMLGRC YLVTGEGQKAL 
ECFCQAASEVGKEEFLDKLIRSEDGEIVSTPRLQYYDKVLRLLD 
VIGLPELVIQLATSAITEASDDW\KSQATL\RTCIFKHHL\DLG 
\HNSQAYGSL*PQIPDSSRQLDCLRQLVWLCERSQLQDLVEFS 
YVNLHNEWGIIESRARAVDLMTHNYYELLYAFHIYRHNYRKAG 
TVMFEYGMRLGREVRTLRGLEXQGNCYLAALNCLRLIRPEYAWI 
VQPVSGAVYDRPGASPKRNHDGECTAAPTNRQIEILELEDLEKE 
CSLARIRLTLAQHDPSAVAVAGSSSAEEMVTLLVQAGLFDTAIS 
LCQTFKLPLTPVFEGLAFKC I KLQFGGEAAQAEAWAWLAANQLS 
SVITTKESSATDEAWRLLSTYLERYKVQNNLYHHCVINKLLSHG 
VPLPWWLINSYKKVDAAELLRLYUJYDLLDLTPYQVIRICGC 


5826 


3 


S71 


KSQLLRDHSAPPPKPCTSVGAMGC* PRQ / S PKEQQRQLKKQKNR 
AAAQRS RQ KHTDKADALHQQHES LE KDNLALRKE I QSLQAE LAW 
WSRTLHVHERLCPMDCASCSAPGLLGCWDQAEGLLGPGPQGQHG 
CREQLELFQTPGSCYPAQPLSPGPQPHDSPSLLQCPLPSLSLGP 
AWAEPPVQLSPSPLLFASHTGSSTiQGSSSKLSALQPSLTAQTA 
PPQPLELEHPTRGKLGSSPDNPSSALGLARLQSREHKPALSAAT 
WQGLWDPSPHPLLAFPLLSSAQVHF 


5827 


194 


2287 | 


GHGSENSALXSYTLREPPFTLPSGLAVYPAVLQDGKFASVFVYK 
RENEDKVNKAAKVP* *HLKTLRHPCLLRFLSCTVEADGIHLVTE 
RVQPLE VALETLS SAEVCAG I YD I LLAL I FLHDRGHLTHNNVCL 
SSVFVSEDGHWKLGGMETVCKVSQATPEFLRSIQSIRDPASIPP 
EEMSPEFTTLPECHGHARDAFSFGTLVESLLTILNEQVSADVLS 
SFQQTLHSTLLNPIPKWRPALCTLLSHDFFRNDFLEVVNFLKSL 
TLKSEEEKTEFFKFLLDRVSCLSEELIASRLVPLLLNQLVFAEP 
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NO: 
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nucleotide 

corresponding 
to first 

sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 

residue of 

sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E=' 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionlne, N=Asparagine, 
P=?roline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V-Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








VAV\KSFLPYLLGPKKDHAQGETPCLLSPALFQSRV_PVLLQLF 
EVHEEHVRMV jLSH lEAYVGALSLREQLKKVX IL \ PQVLLG \LR 
D \TSDS I VAI TLHSLAVLVS LLGPE VWGGERTKI FKRTAP \SF 
TK\NTDLSLEGDPFSQPIKFPINGLSDVKNTSEDSENFPSSSKK 
SEEWPDWSGPE\EPENQTVNI\QIWP\REP\CDDVKSQCTTLDV 
E ESSHDDCE PS SLDTKVNPGGGI TATKPVTSGEQKP I PALLS LT 
EESMPWKSSLPQKISLVQRGDDADQIEPPKVSSQERPLKVPSEL 
GLGEEFTIQVKKKPVKDPEMDWFADMIPEIKPSAAFLILPELRT 
EMVPKKDDVSPVMQFSSKFAAAEITEGEAEGWEEEGELNWEDNN 


5828 


2 


257 


AREGGSLGAVAACGELSYSCDFCPARPHTSW1TRFVKMEFQAVV 
MAVGGGSRMTDLTSSIPKPLLPVGNKPLIWYPLNLLERVGFEEV 
IWTTRDVQKALCAEFKMKMKPDIVCIPDDADMGTADSLRYIYP 
KLKTDV-LVLSCDLITDVALHEWDLFRAYDASLAMLMRKGQESI 
EPVPGQKGKKKAVEQRDFIGVDSTGKRLLFMANEADLDEELVIK 
GSILQKHPRIRFHTGLVDAHLYCLKKYIVDFLMENGN.SITSIRS 
EL\ I P YLV/RGKQFS SAS SQQGTRKEKEGGS KGKRGLKS FRISY 
S FY* KEAKYTGTGAP Y\ D \ ACW I 


5829 


2S0 


1259 


PDCRL I VC CCEDKTI KI WDTTNKQCVNNFSDSVGFANFVDFNPS 
GTCIASAC-SDQTVKVWDVRVNKLLQHYQVHSGGVNCISFHPSGN 
YLI TAS SDGTLKILDL.LKGRL I YTLQ3HTGPVFTVS FS KGGELF 
ASGGADTQVLLWRTNFDELHCKGLTKRNLKRLHFDSPPHLLDIY 
PRTPHPHEEKVETVEDFFLHLLRLIQSLR*SICRSLLPLLWISF 
LLILPQQQKPWGLCQTRVKRPVDIS*TLP*CHQNVCQQPRKRK 
IIPLAVTDALEHIMFQLITVLTQTVSILEQR 
LTLTEDKLKDCLENQQKLFSAVQQKS 


5830 


4496 


3139 


GGKMAAPEERDLTQEQTEKLLQFQDLTGIESMDQCRHTLKQHNW 
NIEAAVQDRLNEQEGVPSVFNPPPSRPLQYMTADHRIYSYWSR 
PQPRGLLGWGYYLIMLPFRFTYYTILDIFRFALRFIRPDPRSRV 
TDPVGDIVSFMHSFEEKYGRAHPVFYQGTYSQALNDAKRELRFL 
LVYLHGDDHQDSDEFCRNTLCAPEVISLINTRMLFWACSTNKPE 
GYRVSQALRENTYPFLAMIMLKDRRE * PV\ VGRL3GL I \QPDDL 
INQLTFIMDANQTYLVSERLEREERNQTQVLRQQQDEAYLASLR 
ADQEKERKKREERERKRRKKEEVQQQKLAEERRRQNLQEEKERK 
LECLPPEPSPDDPESVKIIFKLPNDSRVERRFHFSQSLTVIHDF 
LFSLKESP\EKFQIEA\NFPRR\VLPCIPSEE\WPNPPTLQE\A 
GLSHTEVLFVQDLTDE 


5831 


71 


2897 


FCSXDKCCLYLPDSINKSKSCmKPGAHSQDRHAVMDSERQVKD 
TDDIESPKRSIRDSGYIDCWDSERSDSLSPPRHGRDDSFDSLDS 
FGSRSRQTPSPDWLRGSSDGRGSDSESDLPHRKLPDVKKDDMS 
ARRTSHGEPKSAVPFNQYLPNKSNQTAYVPAPLRKKKAEREEYR 
KSKSTATSPAGLGKKALQDYGPRT\PVS\DDAESTSMFDMRCEE 
EAAVQPHSRARQEQLQLINNQLREEDDKWQDDLARWKSRKRSVS 
QDLIKKEEERKKMEKLLAGEDGTSERRKSIKTYREIVQEKERRE 
RELHEAYKNARSQEEAEGILQQYIERFTISEAVLERLEMPKILE 
RSHSTEPNLSSFLNDPNPMKYLRQQSLPPPKFTATVETTIARAS 
VLDTSMSAGSGSPSKTVTPKAVPMLTPKPYSQPKNSQDVLKTFK 
VDGKVSVNGETVHREEEKERECPTVAPAHSLTKSQMFEGVARVH 
C-SFLELKQDNGSIEINIKKPN3VPQELAATTEKTEPNSQEDKND 
G3KSRK3NIELASSEPQHFTTTVTRCSPTVAFVEFPSSPQLKND 
VSEEKDQKKPENEMSGKVELVLSQKWKPKSPEPEATLTFPFLD 
KMPEANQLHLFNLNSQVDSPSSEKSPVTTPFKFWAWDPEEERRR 
QEKWQQEQERLLQERYQ\KEQDK\LKEE\WEKAQKEVEEEERRY 
YEEEP * 1 1 \ EDPWPFTVS S S SADQLS TS SSMTEGSGTMNKIDL 
GNCQDEKQDRRWKKSFQGDDSDLLLKTRE3DRLEEKGSLTEGAL 
AHSGNPVSKGVHEDHQLDTEAGAPHCGTNPQLAQDPSQNQQTSN 
PTHSSEDVKPKTLPLDKSINHQIESPSERRKSISGKKLCSSCGL 
PLGKGAAMIIETLNLYFHIQCFRCG\ICKGQLGDAVSGTDVRIR 
NGLLNCNDCYMRSRSAGQPTTL 


5832 


2454 


829 


PGRRFRHGSCAFQKQCIMLHICQYFLQGECKFGTSCKRSHDFSN 
SENLEKLEKLGMSSDLVSRLPTIYRNAHDIKNKSSAPSRVPPLF 
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ID 
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to first 

sequence 


Predicted end 
nucleotide 
location 
cor re sponding 
to first 
amino acid 
residue of 
amino acid 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion! 








VPQGTSERKDSSGSVSPNTLSQEEGDQICLYHIRKSCSFQDKCH 
RVHFHLPYKWQFLDRGKWEDLDNMELIEEAYCNPKIERILCSES 
ASTFHSHCLNFNAMT YGATQARRLSTAS SVTKPPHF I LTTDWXW 
YWSDEFGSWQEYGRQGTVHPVTTVSSSDVEKAYLAY/WYTGV*R 
PGSHLEVPGRKAQLRVRFQSLRSEKP3LWHN*KGLPQTQIR\AP 
QDVTTMQTCNTKFPGPKSIPDYWDSSALPDPGFQKITLSSSSEE 
YQKVWNLFNRTLPFYFVQKIERVQNIiALWEVYQHQKGOMQKQNG 
GKAVDERQLFHGTSAIFVDAICQQNFDWRVCGVHGTSYGKGSYF 
ARDAAYSHHYSKSDTQTHTMFLARVLVGEFVRGNASFVRPPAKE 
GWSNAFYDSCVNSVSDPSIFVIFEKHQVYPEYVIQYTTSSKPSV 
TPSILLALGSLFSSRQ 


5833 


170 


3289 


SILCLLSPCWQFGKPWSILSSRSRHSPCTKKGWEGMRKHLHT 
RQGHK* VHVE ISKALWVYRDDYFI RHS I S VSAVI VRAWITHKYR 
GRDWOTKWEENLLHAVAKNYTLLQTIPPFERPFKDHCVCLEWNM 
GYIWNLRANRIPQCPLENDWALLGFPYASSGENTGIVKKFPRF 
RNREL3ATRRQRMDYPVFTVSLWLYLLHYCKANLCGILYFVDSN 
EMYGTP SVFLTEE3YLH I QMHLVKGEDLAVKTKF 1 1 PLKEWFRL 
DISFNGGQ I WTTS IGQDIKSYHNQT I S FREDFHYNDTAGYFI I 
GGSRYVAGIEGFFGPLKYYRLRSLHPAQIFNPLL3KQLAEQIKL 
YYERCAEVQE I VS VYAS AAKHGGE RQ 3ACHLHNS YLDLQRR YGR 
PSMCRAFPWEKELKDKHPSLFQALLEMDLLTVPRNQNESVSEIG 
GKIFEKAVKRLSSIDGLHQISSIVPFLTrSSCCGYHKASYYLAV 
FYETGLNVPRDQLQGMLYSLVGGQGSERLSSMNLGYKHYQGIDN 
YPLDWELSYAYYSNIATKTPLDQHTLQGEQAYVETIRLKDDEIL 
KVQTKEDGDVFMWU<HEATRGNAAAQQRLAQMLFWGQQGVAKNP 
EAAIEWYAKGALETEDPALIYDYAIVLFKGQGVKKNRRLALELM 
KKAASKGLHQAVNGLGWYYHKFKKNYA\KAAKYWLKA\EE\MGN 
PDASYNLGVLHLDGIFPGVPGRNQTLAGEYFHKAAQGGHHEGTL 
WCSLYYITGNLETFPRDPEKAVWAKHVAEKNGYLGHVIRKGLN 
AYLEGSWHEALLYYVLAAETGIEVSQTNLAHICEERPDLARRYL 
GVNCVWRYYNFSVFQIDAPSFAYLKMGDLYYYGHQNQSQDLELS 
VQMYAQAALDGD5QGFFNLALLIEEGTIIPHHILDFLEIDSTLH 
SNNISILQELYERCWSHSITEESFSPCSLAWLYLHLRLLWGAILH 
SALIYFLGTFLLSILIAWTVQYFQSVSASDPPPRPSQASPDTAT 
STAS PAVTPAADAS DQDQPTVTNNPE PRG 


5834 


17 


4020 


RFRRGGGRVFPGAFPASPSDSLGQGNSQGPPRTPKPPRT/QECG 
SAAPGPIPGQSSS*VPLRLEQIQQKADCPLSLELALKPRMAAQV 
TLEDALSNVDLLEELPLPDQQPCIEPPPSSLLYQPNFWTNFEDR 
NAFVTGIARYIEQATVHSSMNEMLEEGQEYAVMLYTWRSCSRAI 
PQVKCNEQPNRVEIYEKTVEVLEPEVTKLMNFMYFQRNAIERFC 
3EVRRLCHAERRKDFVSEAYLITLGKFINMFAVLDELKNMKCSV 
KNDHSAYKRAAQFLRKMADPQSIQESQNLSMFLANHNKITQSLQ 
QQLEVISGYEBLLADIVNLCVDYYENRMYLTFSEKHMLLKVMGF 
GLYLMDGSVSNIYKLDAKKRINLSKIDKYFKQLQWPLFGDMQI 
ELARYIKTSAHYEENKSRWTCTSSGSSPQYNICEQMIQIREDHM 
RFISELARYSKSEWTGSGRQSAQKTDAEYRKLFDLALQGLQLL 
SQWSAHVMEVYSWKLVHPTDKYSNKDCPDSAEEYERATRYNYTS 
EEKFALVEVIAMIKGLQVLMGRMESVFNHAIRHTVYAALQDFSQ 
VTLMEPLRQAIKKKKNVIQSVLQAIRKTVCDWETGHEPFNDPAL 
RGEKDPKSG*DIKVPRRAVGPSSTQLYMVRTMLESLIADKSGSK 
KTLRSSLEGPTILDIEKFHRESFFYTHLINFSETLQQCCDLSQL 
WFREFFLELTKGRRIQFPIEMSMPWILTDHILETKEASMMEYVL 
YSLDLYNDSAHYALTRFNKQFLYDEIEAEVNLCFDQFVYKLADQ 
IFAYYKVMAGSLLLDKRLRSECKNQGATIHLPPSNRYETLLKQR 
HVQLLGRS IDLKRLITQRVSAAMYKSLELAIGRFESEDLTS I VE 
LDGLLEINRMTHKLLSRYLTLDGFDAMFREANHNVSAPYGRITL 
HVFWELNYDFLPNYCYNGSTNRFVRTVLPFSQEFQRDKQPNAQP 
QYLHGSKALNLAYSSIYGSYRNFVGPPHFQVICRLLGYQGIAW 
MEELLKWKSLLQGTILQYVKTLMEVMPKICRLPRHEYGSPG1L 
EFFHHQLKDIVEYAELKTVCFQWLREVGNAILFCLLIEQSLSLE 
EVCDLLHAAPFQNILPRVHVKEGERLDAKMKRLESKYAPLHLVP 
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ID 
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amino acid 
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nucleotide 
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corresponding 

amino acid 

amino acid 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Eistidine, I=Isoieucine , K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








L I F.P.LGTPQQ I AI AREGDLLTKERLCCGLSMFEVILTRI RS FLD 
DPIWRGPLPSNGVMHVDECVEFHRLWSAMQFVYCIPV3THEFTV 
3QCFGDGLHWAGCMIIVLLGQQRRFAVLDFCYHLLKVQKIIDGKD 
EIIKNVPLKKMVERIRKFQILNDEIITILDKYLKSGDGEGTPVE 
HVRCFQPPIHQSLASS 


5835 


4209 


1904 


SGNIRMAQ3SHQIDFQVLHDLRQKFPEVPEVWSRCMLQKNMKL 
DACCA VLSQESTRYLYGEGDLNFSDDSGISGLRNHMTSLKLDLQ 
SQNIYHHGREGSRMNGSRTLTHSISDGQLQGGQSWSELFQQEPQ 
TAPAQVPQ3FNVFGMSSSSGASNSAPHLGFHLGSKGTSSLSQQT 
PRFNPIMVTLAPNIQTGRNTPTSLHIHGVPPEVL1ISPQGNSIYI 
RPYITTPGGTTRQTQQHSGWV3QFNPMNPQQVYQPSQPGPWTTC 
PASNPLSHTSSQQPNQQGHQTSHVYMPISSPTTSQPPTIHSSGS 
SQSSAHSQYNIQNISTGPRKNQIEIKLEPPQRNNSSKLRSSGPR 
TSSTSSSVNSQTLNRNQPTVYIAASPPNTDELMSRSQPKVYISA 
NAATGDEQVMRNQPTLFISTNSGASAASRNMSGQVSMGPAFIHH 
HPPKSRAIGNKSATSPRWVTQPNT\EYTFKITVSPNKPPAVSP 
GWSPTFELTNLLNHPDHYVETENIHHLTDPTLAHVDRISETRK 
LSMGSDDAAYTQD I *RISNS WLGMVAKACYSSALGGQDGR 1 1 * A 
QEFETSWGNIWRLRLYRRF*NYAQMVAHTCSPaYSVD*ALLVHQ 
KARMERLQRELE IQKKKLDKLKSEVNEMENNLTRRRLKRSNS ZS 
QIPSLEEMQQLRSCNRQLQIDIDCLTKEIDLFQARGPHFNPSAI 
HNFYDNIGFVGPVPPKPKDQRSIIKTPKTQDTEDDEGAQWNCTA 
CTK] iNHPAIiTRCEQCEMPRJiF 


5836 


361 


2303 


FHITMCGICCSVNFSAEHFSQDLKEDLLYNLKQRGPNSSKQI.LK 
SDVNYQCLFSAHVLHLRGVLTTQPVEDERGNVFLWNGEIFSGIK 
VEAEENDTQILFNYLSSCKNESEILSLFS3VQGPWSFIYYQASS 
HYLWFGRDFFGRRSLLWHFSNLGKSFCLSSVGTQTSGLANQWQE 
VPAS\DP;? ^LIIiST i~)3FP~ A T "YNCILGNIFLGRILLKKMLIA* 
VKFQQTYQHLYQR»0MKPNCILKN1 J LFL*I*CCHKLHWRLIAVI 
FPMCHLQERYFKSFLLMYT*KEVIQQFIDVLSVAVKKRVLCLPR 
DENLTANEVLKTCDRKANVAILFSGGIDSMVIATLADRHIPLDE 
PrDLLNVAJIAEEKTMPTTFNREGNKQKNKCEIPSEEFSKDVAA 
AAADSPNKHVSVPDRITGRAGLKELQAVSPSRIWNFVEINVSME 
ELQKLRRTRICHLIRPLDTVLDDSIGCAVWFASRGIGWLVAQEG 
VKSYQENAKWLTGIGADEQLAGYSRHRVRFQSHGLEGliNKEIM 
MELGRISSRNLGRDDRVIGDHGKEARFPFLDENWSFLNSLPIW 
EKANLTLPRGIGEKLLLRLAAVELGLTASALLPKRAMQFGSRIA 
KMEKIX3KASDKCGRL.U1KSLENLSIEKETKIJ 


5837 


vm 


903 


NGNAVAQAPVTOCCYLATGSKDQTIRIWSCSRGRGVMILKLPFL 
KRRGGG ID PTVKERLWLTLHWPSNQPTQLVSS GFGGELLQWDLT 
QSWRRKYTLFSAESEGQNHSRIVFNLCPLQTEDDKQLLLSTSMD 
RDVKCWDIATLECSWTLPSLGGFAYSLAFSSVDIGSLAIGVGDG 
M I RVWHTLS IKNNYDVKNFWQGVKSKVTALCWHPTKEGCLAFGT 
DDGKVGLYDTYSNKPPQISSTYHKKTVYTLAWGPEVPPMSLGGE 
GDRPSLALYSCGGEGIVLQHNPWKLSGEAFDINKLIRDTNSIKY 
KLPVHTEISWKADGKIMALGNEDGSIEIFQ\IPNLKLZCTIQQH 
HKLVNT IS WHHE \ HGS PAQKLS YL \ MPSGSQQCS PFTCHNLKNC 
P* KAAPES PSDPLQS P YRTPPQGHTAQDYPVWAWEPHIH* WEGL 
VFCFPIDGYSP3CWD\AFPGKEAPVAIFRG\HQGRLLCVAWSPL 
DPDCIYSG\ADDFCVHKWLTSMQDHSRPPQGKKSIELEKKRLSQ 
PKAKPKKKKKPTLRTPVKLESIDGNEEESMKENSGPVENGVSDQ 
EGE3QAREPEI.PCGLAPAVSREPVICTPVSSGFEKSKVTINNKV 
ILLKKEPPKEKPETLIKKRKARSLLPLSTSLDHRSKEELHQDCL 
VLATAKHSRELNEDVSADVEERFHLGLFTDRATLYRMIDIEGKG 
HLENGHPELFHQLMLWKGDLKCVLQTAAERGELTDNLVAMAPAA 
GYHVWLWAVEAFAKQLCFQDQYVKAASHLLSIHKVYEAVELLKS 
NHFYREAIAIAKARLRPEDPVLKDLYLSWGTVLERDGHYAVAAK 
CYLGATCAYDAAKVLAKKGDAASLRTAAELAAIVGEDELSASLA 
LRCAQELLLANNWVGAQEALQLHESLQGQRLVFCLLELLSRHLE 
EKQLSEGKSSSSYHTWNTGTEGPFVERVTAVWKSIFSLDTPEQY 
QEAFQKLQNIKYPSATN1ITPAKQLLLHICHDLTLAVLSQQI>5ASW 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=31ycine, 
H=Histidine, I=Isoleucine, K-Lysine, 
L=I>eucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possihle nucleotide insertion) 








DEAVQA1LRAWRSYDSGSFTIMQEVYSAFLPDGCDHLRDKLGD 
HQSPATPAFKSLEAFFLYGRLYEFWWSLSRPCPNSSVWVEAGHR 
TLSVEPGQQLDTASTEETDPETSQPEPNRPSEIiDLRLTEEGERM 
LSTFKELFSEKHASLQNSQRTVAEVQETLAEMIRQHQKSQLCKS 
TANGPDKNEPEVEAEQPLCSSQSQCKEEKNEPLSLPELTKRLTE 
ANQRMAKFPESIKAWPFPDVLECCLVLLLIRSHFPGCLAQEMQQ 
QAQELLQKYGtTTKTYRRHCQTFCM 


5838 


110 


98 


KTMPHLLVTFRDVAIDFSQEEWECLDPACRDLYRDVMLENYSNL 
ISLDLESSCVTKKLSPEKEIYEMES\PSGRIWGNVSTITFQYNG 
LGDNMECKGNLEGQVSKSEGLYMCVKITCEEKATESHSTSSTFH 
RI I /HYQGKI VKCKE CRQG FSYLS CL I QHEEMHN I * KCSEVNKH 
RNTFS KKPS YI * HQ\KFRLGEKPYECMECGKAFGRTSDLIQHQK 
IHTNEKPYQCNACGKAFIRGSQLTEHQRVHTGEKPYECKKCGKA 
FSYCSQYTLHQRIHSGEKPYECKDCGKAFILGSQLTYHQRIHSG 
EKPYECKECGKAFILGSHLTYHQRVHTGEKPYICKECGKAFLCA 
SQLNEHQRIHTGEKPYECKECGKTFFRGSQLTYHLRVHSGERPY 
KCKECGKAFISNSML1QHQR1HTGEKPYKCKECGKAFICGKQLS 
EHQRIHTGEKPFECKECGKAFIRVAYLTQHEKIHGEKHYECKEC 
GKTFVRATQLTYHQRIHTGEKPYKCKECDKAF/HLWLTILSEHQ 
RIHRGEKPYECKQCGR/LFIRGSHL/NEHLRTIITGEKPYECKEC 
GRAFSRGSEHTLHQRIHTGEKPYTCVQCGKDFRCPSQLTQHTRL 
HN*EYSSHKICMII3 IALASLDFAHLQEKNPEN 


5839 




2425 


GR?FPRPPRALPRLPLRGKRQDGRWTVDFEECLKD\SPRFRAAL 
EEVEGDVAELELKL\PXLVKLCIA\MIDTGKAFCVANKQFMNGI 
RD\LAQNS\NMDA\WETKFAPSFLDSLQEMINFHTIL/L*PNS 
E IN*GHS FQNFVKEDLRKFKDAKKQFENSQ * KRKKIALVKNAPV 
PSRPASLEL*KPPNILTATRKCFRHIALDYVLQINVLQSKRRSE 
ILKSMLSFMYAHLAFFHQGYDLFSELGPYMKDLGAQLDRLVGDA 
AKEKREMEQKHSTIQCKDFSRDDSKLKYNVDAANGrVMEGYLFK 
RASNAFKTWNRRWFSIQNNQWYQKKFKDNPTVWEDLRLCTVK 
KCEDIERRFCFEWSPTKSCMLQADSEKLRQAWIKAVQTSI\AT 
AYREKDDESEKLDXKSSPSTGSLDSGNESKEKLLKGESALQRVQ 
CIPGNASCCDCGLADPRWASINLGITLCIECSGIHRSLGVHFSK 
VRSLTLDTWEPELLKLMCELGNDVINRVYEANVEKMGIKKPQPG 
QRQEKEAYIRAKYVERKFVDKIFL*SLSPP\EQQKK\FVSKSSE 
EKRLSISKFGP\GDQVRASAQSSVRSNDS3IQQSSDDGRESLPS 
TVSANSLYEPEGERQDSSMFLDSKHLITPGLQLYRASYEKNLPKM 
AEALAHGADVNWANSEENKAT PL I QAVLG3S LVTCE FLLQNGAN 
'/NQRDVQGRGPLHHATVLGHTGQVCLFL.KRGANQHATDEEGKDP 
LS IAVEAANAD I VTLLRLARMNEEMRESEGLYGQPGDET YQD I F 
RDFSQMASNNPE KLNRFQQDS QKF 


5840 


698 


3610 


KHLELPRQHLTTLWQISSPRWRSPQRAFMSALSKTQTQSAPALQ 
GLSSLLQSVTGNPVPASEAASQSTSASPANTTVYTIKGRNLPSS 
AQPFIPKSFNYSPNSSTSEVSSTSASEASIGQSPGLPSTAFKLP 
SNTKGFTATHNTSPAAPPTEVTTCQSSEVSKPKL\ESESTSPSL 
\3MKIHNFLKGNPGFSVA*NLKHPNPAGSLGSSAPSESHPSDFQ 
RGPTSTSIDNIDGTPVRDERSGTPTQDEMMDKPTSSSVDTMSLL 
SKIISPGS5TPSSTRSPPPGRDESYPRELSNSVSTYRPFGLGSE 
SPYKQPSDGMERPSSLMDSSQEKFYPDTSFQEDEDYRDFEYSGP 
PPSAMMNLQKKPAKSILKSSKLSDTTEYQPILSSYSHRAQEFGV 
KSAFPPSVRALLDSSENCDRLSSSPGLFGAFSVRGNEPGSDRSP 
SPSKNDSFFTPDSNHNSLSQSTTGHLSLPQKQYPDSPHPVPHRS 
LFSPQNTLAAP TGHPPTS GVE KVLAS TISTTSTIEF KNMLKNAS 
RKPSDDKHFGQAPSKGTPSDGVSLSNLTQPSLTATDQQQQEEHY 
RIETRVSSSCLDLPDSTEEKGAPIETLGYHSASNRRM3GEPIQT 
VESIRVPGKGNRGHGREASRVGWFDLSTSGSSFDNGPSSASELA 
SLG3GGSGGLTGFKTAPYKERAPQFQESVGSFRSNSFNSTFEHH 
LPPSPLEHGTPFQREPVGPSSAPPVPPKDHGGIFSRDAPTHLPS 
VDLSNPFTXE AALAHAAP P P P PGEHS G I P F PTPP P P P ? PGEHS S 
SGGSGVPFSTPPPPPPPVDHSGWPFPAPPLAEHGVAGAVAVFP 
KDHSSLLQGTLAEHFGVLPGPRDHGGPTQRDLNGPGLSRVRESL 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
amino acid 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 
residue of 

sequence 


Amine acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Hlstidine, I=Isoleucine, K=L>ysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=?roline / Q=Glutamine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








TLPSHSLEHLGPPHGGGGGGGSNSSSGPPLGPSHRDTISRSGII 
LRSPRPDFRPREPFLSRDPFHSLKRPRPPFARGPPFFAPKRPFF 
PPRY 


5841 




762 


glrl.flvltvwpmmkpswlsrtefskrllcrtlwcqsgwssrsy 
trsmlkmttsinrrsrtstkstrtsarpgltatvsiglsdsptw 
rhcwmtarscsgekgghwaprqvgvyllpgrvgcvssrvspsfp 
gdg1dsglarrgravsalasglveepmlgppfhptprfkavsak 
skedlvsqgfteftiedfhntfmdlieqvekqtsvadllasfnd 
qstsdylwylrlltsgylqreskffehfieggrtvkefcqXqe 
\vepmgkesdhihiialaqglqrvhpgweymgprpraattnphi 
fp*glpspkvyllyrpg\hyeilykiglgssplgcpgcpllara 
lghcyrg fs wvkws yftpfflshdpppmfy 


5842 


307 




QEPTAJFKLRSTCGCGREMTCPDKPGQLINWFICSLCVPRVRKL 
WSSRRPRTRRN1,LLGTACAIYLGFLVSQVGRASLQHGQAAEKGP 
HRSRDTAEPSFPEIPLDGTLAPPESQGNC-STLQPNVVYITLRSK 
RSKPANIRGTVKPKRRKKHAVASAAPGQEALVGPSLQPQEA\EG 
KLML*HLC-TLREQTWLRLESDPGGWCGVRE/WRAGGFDFLQPSS 
RESNIRIYSESAPSWLSKDDIRRMRLLADSAVAGLRPVSSRSGA 
RLLVLEOGAPGAVLRCGPSPCGLLKQPLDMSEVFAFHLDRILGL 
NRTLPSVSRKAEFIQDGRPCPIILWDASLSSASNDTHSSVKLTW 
3TYQQLLKQKCWQNGRVPKPESGCTEIHHHEWSKMALFDFLLQI 
YNRLDTNCCGFRPKKEDACVQNGLRPKCDDQGSAALAHIIQRKH 
DPRHLVFIDNKGFFDRSEDNLMFKLLEGIKEFPASAVYVLKSQH 
LRQKLLQSLFLDKGYWESQGGRQGIEKLIDVIEHRAKILITYIN 
AHGVKVLPMNE 


5843 


500 




GTARLVTCWVLHGQ*VKKPAWEPGWWL*Q*RCRPKGWGLGAGM 
RGSRMSQPPQCLRRAQSSCCHFMVKLLDDGTFMIPGEKVAHTSL 
DALVTFHQQKP I EPRRELLTQPCRQKDPANVDYEDLFLYSNAVA 
EEAACPVSAPEEASPKPVLCHQSKERKPSAEM/RQNNHQGSHFL 
LPPKIPSWRDPPETLEEPQNAPRERPEGPAAAKKPPRHCELWT 
LGCPEIHGDLRPWDRKRQPRSLRGSHLGGQRLHGSLCGHISQKP 
LTAPGTKRQKG PHQEGRE VGQLH*GDPRGQELAPNGS ES P ILPG 
VQARAPGLGRA 




202 


2471 


FDSAVLSSINVMAVLPGPLQLijGVLLTISLSSIRLIQAGAYYGI 
KPLPPQIPPQMPPQIPQYQPLGQQVPHMPLAKDGLAMGKEMPHL 
Q YGKE Y PHLPQ YMKE I Q PAPRMGKEAVP KKGKE I P LAS LRGEQG 
PRGEPGPRGPPGPPGLPGHGIPGIKGKPGPQGYPGVGKPGMPGM 
PGKPGAMGMPGAKGEI3QKGEIGPMGIP*PQGPPGPHGLPGIGK 
PGGPGLPGQPGPKGDRGPKGLPGPQCLRGPKGDKGFGMPGAPGV 
KGPPGMHG?PGPVGLPGVGKPGVTGFPGP\QGPLGK\PGAPGEP 
GPQ3PIGV?GVQGPPGIPGIGKPGQDG\IPGQPGFPGGKGEQGL 
PGLPGPPGLPGIGKPGFPGPKGDRGMGGVPGALGPRGEKGPIGA 
PGI 3GP PGEPGLPGI PGPKGPPGAIGFPGPKGEGGI VGPQGPPG 
PKGEPC-LQGFPGKPGFLGEVGPPGMRGFPGP1GPKGEHGQKGVP 
GLPGVPGLLGPKGEPGIPGDQGLQGPPGIPGIGGPSGPIGPPGI 
PGPKGEFGLPGPPGFPGIGKPGVAGLHGPPGKPGALGPQGQPGL 
I ! PGP PGPPGPPAVMPPTPPPQGEYLPDMGLGIDGVKPPHAYG 
AKKGKNGGPAYEMPAFTAELTAPFPPVGAPVKFNKLLYNGRQNY 
NPQTGIFTCEVPGVYYFAYHVHCKGGNVWVALFKNNEPVMYTYD 
EYKKGFLDQASGSAVLLLRPGDRVFLQMPSEQAAGLYAGQYVHS 
SFSGYLLYPM 


5845 


215 


20S1 


HASNKSASLQDKMAMPKEKTAMCLWELARFNRVQPQYKLLNER 
GPAHSKMFSVQLSLGEQTWESEGSSIKKAQQAVGNKALTESTLP 
KPI*-KPPKSWVNNNPGCITPTVELNGLAMKRG\KPAIHRPLDPK 
P FFNNRANYNFQVMYNQRYHCP I PKIFYVQLTVGNKEFFGEGKT 
RQAARHNAAMKALQALQNEPIPERSPQNGESGKDMDDDKDANKS 
EISLVFEIALKRNKPVSFEVIKESGPPHMKSFVTRVSVGEFSAE 
C-EGNSKKLSKKRAATTVLQELKKLPPLPWEKPK\HFFKKRPKT 
IVKAGPEYGQGMNPISRLAQIQQAKKEKEPDYVLLSERGMPRRR 
EFWIQVKVGNEVATGTGPNKKIAKKNAAEAMLLQLGYKASTNLQ 
EQLEKT3ENKGWSGPKPGFPEPTNNTPKGILHLSPDVYQEMEAS 
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ID 
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corresponding 
to first 

residue of 

sequence 


Predicted er.d 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=?ossible nucleotide insertion) 








RHKVISGTTLGYLSPKDMNQPSSSFFSISPT3NSSATIARELLM 
NGTSSTAEAI3LKGSSPTPPCSPVQPSKQLEYLARIQGFQVHYC 
DROSGKECVTCr.TLAPVQMTFHArGSSIEASHDQV*YATAILLC 
YGPARKWKAIKMEAMCAHAALLSLIHYLLAPSARLEKSKLFALG 

N- 


584S 


1126 


45S 


FSKLIMKTFIIGISGVTNSGKTTLAKNLQKHLPNCSVISQDDFF 
KPESEIETDKNGFLQYDVLEALNMBKMMSAISCWMESARHSWS 
TDQESAEEIPILIIEGFLLFNYKPLDTIWNRSYFLTIPY3ECKR 
RRSTRVYQPPDSPGYFDGHVKPMYLKYRQEMQDITWEWYLEGT 
KSEEDLFLQVYEELIQELAKOKCLQVTA*RRNTTNPS/CK*IRK 
LQGVI 


5847 


2769 


505 


apemedlsspdstllqgghnllssasfqesvtfkdvivdftq.ee 
wkqldpgqrdlfrdvtlenythlvsiglqvskpdvisqleqgte 
pwimepsipvgtcadwetrlensvsapepdiseeelspevivek 
hkrddswssnllesweyegslerqqanqqtlpkeikvtektips 
wekgpvnnefgksviwssnlvtqepspeetstkrsikqnsnpvk 
keksckcnecgkafsycsalirhqrthtgekpykcn*/cvekaf 
srsenlinhqrihtgdkpykcdqcgkgfiegpsltqhqrihtge 
kpykcdecgkafsqrthlvqhqrihtgekrytcnecgkafsorc; 
hfmehqkihtgekpfkcdecdktftrsthltqhqkihtgektyk 
cnecgxafngpstfirhhmihtgekpyecnecgkafsqhsnltq 
hqktiitgekpydcaecgksfsywsslaqhlkihtgekpykcnec 
gkafsycssltqhrrihtrekpfecsechkafsylsnl.nqhqkt 
htqekayeckecgkafirssslakherihtgekpyqchecgktf 

^SYGSSLIQHRKIHTGEKPYKCNECGRAFMQNTHLTQHKRIHTGA 
KPYECASCGKAFRHCSSLAQHQKTHTEEKPYQCNKCEKTFSQSS 
HLTQHQRIHTGEKPYKCNECDKAFSRSTHLTQHQRIHTGEKPYK 
CNECGK\TFSQSTYLIQHQRIHSGEKPFGCNDCGKSFRYRSA^N 
KHQRLHPGI 


5848 


22 


2961 


AAPRRLLRGGDGDRTPRFPLPALLRPGPPAEAAPERRKMPAVSK 
GDGMRGLAVFISDIRNCKSKEAEIKRINKELANIRSKFKGDKAL 
3GYSKKKYVCKLLFIFLLGHDIDFGHMEAVNLLSSNRYTEKQIG 
YLFISVLVNSKSELIRLINNAIKNDLASRNPTFMGLALHCIASV 
GSREMAEAFAGEIPICVLVAGDTMDSVKQSAALCLLRLYRTSPDL 
VPMGDWTSRWHLLNDQHLGWTAATSLITTLAQKNPEEFKTSV 
SLAVSRLS\RIVTSASTDLQDYTY*FCPGFLGLSVKLLRLLQCY 
PPPDPAVRGRLTECLETILNKAQEPPKSKKVQHSNAKNAVLFEA 
ISLIIHHDSEPNLLVRACNQLGQFLQHRETNLRYLALESMCTLA 
SSEFSHEAVKTHIETVINALKTERDVSVRQRAVDLLYAMCDRSN 
AI Z r "M1SYLETADYSIREEIVLKVAILAEKYAVDYTW\YVD 
':'.Ll,[lLi.-ilAGDYVSEEVWYRVIQIVINRDDVQGYAAKTVFEALQ 
APAraENLVKVGGYILGEFGWLIAGDPRSSPLIQFHLLHSKFHL 
CSVPTRALLLSTYIKFVNLFPEVKPTIQDVLRSDSQLRNADVEL 
QQRAVEYLRLSTVASTDILATVLE3MPPFPERESSILAKLKKKK 
GP8TVTDLEDTKRDRSVDVNGGPEPAPASTSAVSTPSPSADLLG 
LGAAPPAPAGPPPSSGGSGLLVDVFSDSASWAPLAPGSEDNFA 
RFV^KNNGVLFENQLLQIGLKSEFRQNLGRMFIFYGNKTSTQFL 
NFTPTLICSDDLQPNLNLQTKPVDPTVEGGAQVQQWNIECVSD 

RWKQLSNPQQEVQNIFKAKHPMDTEVTKAKIIGFGSALLE3VDP 
NPANFVGAGIIHTKTTQIGCLLRLEPNLQAQMYRLTLRTSXEAV 
SQ31CELLSAQF 


5849 


3545 


1895 


[ 1 ^' ' <~) AGLE LLSSSIJPPS S AS RSAGI TGMRHQVQ 
P * DPCMSLSPPCFTEEDRFSLEALQTIHKQMDDDKDGG1EVEES 
DEFTREnMKYKDATNKHSHLHREDKHITIEDLWKRWKTSEVHNW 
TLEDTL2WLIEFVELPQYEKNFRDHNVKGTTLPRIAVKEPSFMI 
SQLKISDRSHRQKLQLKALDWLFGPLTRPPHNWMKDFILTVSI 
VIGVGGCWFAYTQNKTSKEHVAKMMKDLESLQTAEQSLMDLQER 
LEKAQEENRNVAVEKQNL*RKMMDEINYAKEEACRLRELREGAE 
CELSRRQYAEQELEQVRMALKKAEKEFELRSSWSVPDALQKWLQ 
L THE V S VQ Y YN I KRQNAE MQLA I AKD E AE K I KECKR S TVFGTLHV 
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ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 

residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 


Amine acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Ijeucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutaraine , R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








AHSSSLDEVDHKILEAKKALSELTTCLRERLFRWQQIEKICGFQ 
IAHNSGLPSLTSSLYSDHSWVVMPRVSIPPYPIAGGVDDLDEDT 
PPIVSQFPGTMAKPPGSLARSSSLCRSRRSIVPSSPQPQRAQLA 
PHAPHPSHPRHPHHPQHTPHSLPSPDPDILSVSSCPALYRNEEE 
EEAIYFSAEKQWEVPDTASECDSLNSSIGRKQSPP/SKPRDIPN 
IIS/DERYQEMRCP*EIPSGGIL 


5850 


3 


1895 


KAVLNFSASGSVISLTGSNPKHDASMWHLKKNGIIVYI.DVPLLN 
LICRLKLMKTDRIVGQNSGTSMKDLLKFRRQYYKKWYDARVFCE 
SGASPEEVADKVLNAIKRYQEVDSETFISTRHVWPEDCEQKVSA 
EFFIEAVIEGLASDGGLFVPAKEFPKLSCGEWKSLVGATYVERA 
QILLERCIHPADIPAARLGEMIETAYGENFACSKIAPVRHLSGN 
QFILELFHGPTGSFKDLS1QLMPHIFAQCIPPSCNYMILVATSG 
DTGSAVLNGFSRLNKNDKQRIAWAFFPENGVSDFQKAQIIGSQ 
RENGWAVGVESDFDFCQTAIKRIFNDSDFTGFLTVEYGTILSSA 
NS I NWGRLL PQ WYHAS AYLDL VS QGFI S FGS PVDVC I PTGNFG 
KILAAVYAKMMGIPIRKFICASNQNHVWTDFIKTG\HYDLRGKE 
N*AQTFFTVQ*IFLPNL£NLERHLHLMANKDGQLMTELFNRLES 
QHHFQIEKALVEKLQQDFVADWCSEGECIAAINSTYNTSGYILD 
PHTAVAKWADRVQDKTCPVIISSTAHYSKFAPAIMQALKIKEI 
NETSS3QLYLLGSYNALPPLHEALLERTKQQEKMEYQVCAADMN 
VLKSHVECLVQNQFI 


5851 


3120 


1802 


RCYLQ7LALLLTSTSARAAAAIAAAEEPAGSPSVMTKAGDHNKQ 
RGCCGSLADYLTSAKFLLYLGHSLSTWGDRMWHFAVSVFLVELY 
GNS LLLTAVYGL WAGS VLVLGA 1 1 GDWVDKNARLKVAQTS L W 
QNVSVILCGIILMMVFLHKHELLTMYHGWVLTSCYILIITIANI 
ANLASTATAITIQRDWIWVAGEDRSKLANMNATIRRIDQLTNI 
LAPMAVGQIMTFG3PVIGCGFISGWNLVSMCVEYVLLWKVYQKT 
PALAVKAGLKEEETELKQLNLHKDTEPKPLEGTHLMGVKDSNIH 
ELEHEQEPTCASQMAEPFRTFRDGWVSYYNQPVF/LGWHGSCFP 
LYDCPGL*LHHHRVRI J HSGTENFHPQYFD3SISYNWNNGNCSFY 
LATSKMWFGSDRSDLRIGTAFLFDLVCDLCIHAWKPPGLVRFSF 


5852 




422 


SSSYPSLPALLRARSAPGHCTHRSCGPEWRIDSISRLEMQGARR 
SGKAQAQPTILLLVPRLRKSLPSIWG/SLMGFFITSGPG/WFRQ 
YYFFISGRH*VLFTESDFYYVAMDFGGHGLSSHYSPGVPYYLQT 
FVSEIRRWAGKKQSVYFRRCGGCSRAPPLITGGGVGSRKQRWP 
ESGAWALAPGLPAIHGRSWES 


5G53 


223 


1346 


R-iLGLSRVKGLHGPAASAWISDPETKGDPGGPWGMWRGSDLRPR 
PVSLTGLTLVCK*AAQGPQV\HSVKLCFGLGG\PCLL\FPIFRP 
LLLHPRRPRLHPGTRGVAVEPHALRWHVAHGEEAGIRAAGPGH 
GGVEIPQG/VGSLGARRGLRPSRPSSRHRNRVPAPPPGRPLATP 
HRRRFPPDPALTCPGLGQDQGPREQQKQGSGRHDTILGDWGESE 
SRWVRGNFRTGTAATLIGFSRNPTLNGSENWGSLVSIQEEGPDT 
GWEREKRNPAEMGNPQRWASPIHTPPLGPEILRAMPEALRAMPE 
ALGLRPDPATSVPSAL8/QTF/PESWPRSCLRNQGETLGMGPVP 
LSSLCITESPSQNWTPCLLLLTCPRGLF 


5854 


86 


938 


KGRNTAPEKKGAALNNRENASS*NGY/SRNKQDIRRIENHIIQE 
LiCKLCAMIXRVLLERLENTRKLRELTEGRTLDWPQNRITEVSAK 
RQ I VTE YRE KGKRN+ E S KKRDLEGRSRR YNLCI I G I PETEDRAS 
GP.ETIKDLLE/ENFPELKNELDLQMEKAHRIPLKFNEKKAASRH 
IRVTFL/KFQRRNILQASSQRKQVTYKGAKVRLTSDFSPAILNA 
RRQW/K/P I SRVLRENNFEPRI I YSAKLSFLYKGNWKTFLD IQG 
LGKYINQELSLKILLKDLLQLTENLN 


5855 


53S 


2391 


IF . '^vAPSRISHLHK\FLFLLL?SLLMGYSESPPPITDSWAP 
FISLTHHVLSQSQSPLSSNCWICLSTHTQ*FTALPADLLTWTQS 
NVSI.HISYLAIPFLADSFLKPV/L*PGNSAKHLSFKLSSLSMVS 
GRAVALLHLIASGLTSIQTNTASSKPPIWGY\LSTQTSFISPPP 
LCLSRTYPNPAHATMVGQVPQSLCGLIFTL/RTPCRPSILHPNY 
K1ISTSAWQKVLCFSGSPTIHTSLHLTTGSSFLSFHPIPGFPAA 
NSALYVSSLKGPPGKNVTIPSPVTGT*QPPHRGSN/RLTVDKDN 
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ID 
NO: 
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corresponding 
to first 
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amino acid 


Predicted end 
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corresponding 
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Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine , 
P=Prol ine , Q=Glutamine , R=Arginine , 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








FFLS?KPIJSLHQLPSQ\TPYQALTGAALAGSYPIWEKENTLSWL 
PTFTYNFCLSTPSLFFLCDTN* YLCLPANWSGTCTLVFQAPTIN 
I LP PNQT I L 1 3 VEAS I S S 9 P I RNKWALHL I TLLTGLG I TAALGT 
GIAGITTSIT3YQTLFTTLSNTVEDMHTSITSLQRQLDFLVGVI 
LQNWRVLDLLTTEKGGTCIYLQEECCFCVNESGIVHIAVRRLHD 
RAAEL*HQVADSWWQGSSLLRWIPWVAPFLGPLIFLFLLLMIGP 
CIFNLVSRFISQRLNCFIQASMQKHIDNIFHLCHV*YQSLRGNH 
SEAPEPRP 


5856 


173 


1137 


PWLHGLGLSAVFLFYL*/YVTFHLYGGIILLLLIFIS1AGILYK 
FQDVLLYFPEQPSSSRLYVPMPTGIPHENIFIRTKDGIRLNLIL 
IRYTGDNSPYSPTI1YFHGNAGNIGHRLPNALLMLVNLKVNLLL 
VDYRGYGKSEGEASEEGLYLDSEAVLDYVMTSPDLDKTKIYLSG 
RSLG\GAAAIHLASDNSHRISAIMVENTFLSIPHIV1ASTLFSFFP 
MRYLPLWCYKNKFLSYRKISCCRMPSLFISGLSDQLIPPVMKKQ 
LYELSPSRTKRLAIFPDGTHNDTWQCQGYFTALEQFIKEVVKSH 
SPEEMAKTSSNVTII 


5857 




5S3 


KLIGKVLVLSWADAMAA7AVEPQGPALGSEPMMLGSPTSPKPG 
VNAQFLPGFLMGDLPAPVTPQPRSISGPSVGVMEMRSPLLAGGS 
PPQPVVPAHKDKS3APPVRSIYDDISSPGLGSTPLTSRRQPNIS 
VMQSPLVGVTSTP3TGQSMFSPASIGQPRKTTLSPAQLDPFYTQ 
GDSIiTSEDH\LDDSWGDCIWGFLKASA\SYILL\QFAQYGGIS* 
NMWMSNTGNWMHI RYQS KLQARKALS KDGRI FGE S 1KIGVKPCI 
DKSVM3SSDRCALSSPSLAFTPPIKTLGTPTQPG3TPRISTMRP 
LATAYKASTSDYQVI E DRQTPKKDES LVSKAME YMFGW 


5858 


3 55 


1419 


PPHQPAAASTSXHQQQQPPPPPQDSSKPVVAQGPGPAPGVGSAP 
PASSSAPPATPPTSGAPPGSGPGPTPTPPPAVTSAPPGAPPPTP 
PSSGVPTTPPQAGGPPPPPAAVPGPGPGPKQGPGPGGPKGGKMP 
3GPKPGGGPGLSTPGGHPKPPHRGGGEPRGGRQHHPPYHQQHHQ 
SPPPGGPGGRSEEKISGPRRGFKANLSLLRRPGEKTYTQRCRFC 
LLGIYLLISRRMNSRRLFAKIWENQEKFLSTKAKDSEFIKLESR 
ALA*NCPKPELG*YTP*GGRQLPSSLFPTHACLPLSCSVIFSPF 
MFPQ*IJCWGRKPFRPNLGPHLKGAVCNRWDDPWEGPTGKGHCLN 
FAS 


5859 


307 


1503 


GGSSARPRASSRRMLSRKKTKNEVSKPAEVQGKYVKKETSPLLR 
NLMPSFIRHGPTIPRRTDICLPDSSPCTAFSTSGDGWSRNQSFL 
RTPIQRTPHEIMRRESNRLSAPSYLARSLADVPREYGSSQSFVT 
3VSFAVENGDEGSRYYYSDNF7DGQRKRPLGDRAHEDYRYYEYN 
HDLFQRMPQNQGRHAS3IGRVAATSLGNLTNHGSEDLPLPPGWS 
VDNTMRGRKYYIDHNTNTTHWSHPLEREGLPPGWERVESSEFGT 
YYVDHTNKKAQY\RHP3APTCTSV*STTSCHI/AS/RQQTERNQ 
SLLVPANPYHTAEIPDWLQVYARAPVKYDHILKWELFQLADLDT 
YQGMLKLLFMKELEQIVKKYEAYRQALLTELENRKQRQQWYAQQ 




2956 


1270 


TIRVEEFPLCPGGGKAQbSSASLLGAGLLLQPPTPPPLLLLLFP 
LLLFSRLCGALAGPIIVEPHVTAVWGKNVSLKCLIEVNETITQI 
SWEKIHGKSSQTVAVHHPQYGFSVQGEYQGRVLFKNYSLNDAT1 
TLHNIGFSDSGKYICKAVTFPLGNAQSSTTVWLVEPTVSLIKG 
PDSLIDGGNETVAAICIAATGKPVAHIDWEGDLGEMESTTTSFP 
IIETATI ISQYKLFPTRFARGRRITCWKHPALEKDIRYSFILDI 
QYAP3VSVTGYDGNWFVGRKGVNLKCNADANPPPFKSVWSRLDG 
QWPDGLLASDNTLHFVHPLTFWYSGVYICKVT\NEPGSKEVTQK 
VHPTFQDPSLPTYPPLPALQFQWASPSTA*TSRD\LATEP*K1A 
PS PLSTL\ATIKGWTQLPTIIA* CSGVGALFI V\LVKCFGLGI F 
■ 'YE - F PTFRGDYFAKNYIPPSDMQKESQIDVLiQQDELDPYPDSV 
KKENI HPWINLIRKDYLEEPEKTQWNNVENLNRFERPMDYYEDL 
KMGMKFVSDEHYDENEDDLVSHVDGSVISRREWYV 


58S1 


2051 


1305 


RVCACVQAFWLVASSGDDSQGGDKCGCEVGSWVGSMR\nmARLL 
1 ^-PrACAAFAQQPAG/EPRRGLAGVGEGGPQCSWTOYRC 
TLEFLVSLLGTDLARGRGNSASGPTAPADSKQL/ML*DVHRRVI 
L3 * RMNSGS PA3DNAPSQRFCTNLSEGLRFG IS PSWREALYGCH 
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Predicted end 
nucleotide 

corresponding 
to first 
amino acid 

amino acid 
sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L= Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=31utamine, R=Arginine, 
S=Serine, T=Threonine, V= Valine, 
W=Tryptcphan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








A 


S8S2 


155S 


483 


FPFQLIMGEIKVSPDYNWFRGTVPLKKIIVDDDDSKIWSLYDAG 
PRSIRCPLIFLP PVSGTADVFFRQ I LALTGWGYRVIALQYPVYW 
DHLEFCDGFRKLLDHLQLDKVHLFGASLGGFLAQKFAEYTHKSP 
RVHSLILCNSFSDTSIFJ3QTWTANSFWLMPAFMLKKIVLGNFSS 
GPVDPMMADAIDFMVDRLESLGQSELASRLTLNCQNSYVEPHKI 
RDIPVTIMDVFDQSALSTEAKEEMYKLYPNARRAHLKTGGN'FPY 
LCRSAEVNLYVQIHL/R/RNSMEPNTRPLTHQWSVPRSLRCRKA 
ALASARRSSSVSLAVNDELTRCVLV*SVASAPVSRPFPSGSSGS 
PVLTVSGK 


5863 


7.714 


249 


PFPSRGSLPLAAPREDTMGPLMVLFCLLFLYPGLADSAPSCPQN 
VNISGGTFTLSHGWAPGSLLTYSCPQGLYPSPASRLCKSSGQWQ 
TPGATRSLSKAVCKPVRCPAPVSFENGIYTPRLGSYPVGGNVSF 
ECBDGF I \LRGS PVRQCRPNGMWDGETAVCDNGAGHCPNPGI SL 
GP\VRTGFRFGHGDKVRYRCSSNLVLTGSSERECQGNGVWSGTE 
PICRQPYSYDFPEDVAPALGTSFSHMLGATNPTQKTKESLGRKI 
QIQRSGHLNLYLLLDCSQSVSENDFLIFKESASLMVDRIFSF3I 
NVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDH3N 
3TGTNTYAALNSVYLMMNNQMRLLGMETMAW\QEIRHAIILL\T 
DGK\SHMGGSPKTAVDHIREILNINQKRNDYLDIYAIGVGKLDV 
DWRELNELGSKKDGERIIAFILQDTKALHQVFEHMI.DVSKLTDTI 

cgvgnmsanasdqertpwhvtikeksqet\c\rgalisdqwvlt 
aahcfrdgndhslwrvnvgdpkscwgkefliekavispgfdvfa 
kkkqgil\efygd\diall\klackvkm\sthcqgpsclp\ctm 
\eanlgflretfkgstcr\dhenel/vwnkqsv\pahf\val\n 

GSKLEHLTLRMGVEWTSCCRGLSPKKKTM\FFNLT\DVRE\WT 
D\QFL\CS\GPQEDESP\CK*3\SGGA\VFLERRFRLSAGGVWC 
iWLL\YfJP\Ll t (lSA\nKNSPKKGPSVAKVPPPTR/DFHIN\LFP 
Q*SPWLRQHPGGMS i IFLPLLANGHLSPFACPARICRPLHFLPS 
EWATLRTL 


5864 


173 


1013 


PLISVPQSLISLPQPLLCFPGGQEPSAPSPCLYSFLWACSFTMG 
KLPPSIPPSSPLACVLKNLKPLQLTPDLKPKCLIFFCNTAWPQY 
KLDNDSK*PBKGTFEFSILQVLDNSCHKMGKWSEVPDVQAFF\S 
HWSLPSLCSQC/GLIPNLSSFSPFCSFG/PPPQVPSP/TESFFS 
MDSSDLPPSPQAAPRQAEPGPNSHLASAPPPYNPFIT3PFHTWS 
SLQFHSVT5PFPPAQQFTLKKVAGAKGIVKVSAPFSLSQIR*RL 
GSFSSNIKIQFSSWLIWQQP 


58S5 


568 




CLPGPRWGEGWRAGHTIVGCIFFKTAIISHFKGGMYLCVCMCTC 
LSVCVCVQVGSWICV/CVSMCACVSLCTC\ICRCISMYTREHAC 
ACTRV*VYMCMS/VCTCVSTCIDWVCAHVCVYMCLCLGYA*AC 
TCV*MCVCMHEHVCMC/VCACSCVLL/CRGHICM/MCMSAYICI 
/CVYVCVLCVWACMRMSTCVWLVYG*ACTCVWMHM/CSCTCR/C 
VHVCCMSMHACECLCVYLHICGCAGTRRWWAGSARGSRSCSRLP 
CWAPGPGLSLPGPSCPSVEQGLGGGPGQLQGRSGEARLGEHRGW 
GSPAAVCSRNCTVSPRRGADCFEAPDVPKQPPGWGRASFEERGC 
GGRGWVCAPPLNGPQCCCFSIKPELKAKKKK 


5866 




3197 


ARPEVPAPPAWLSRRGAAKMGDKKDDKDSPKKNKGKERRDLDDL 
KKEVAMTEHKMSVEEVGRKYNTDC VQGLTHSKAQE I LARDGPNA 
LTPPPTTPEWVKFCRQLFGGFSILLWIGAILCFLAYGIQAGTED 
DPSGDNLYLGIVLAAWIITGCFSYYQEAKSSKIMESFKNMVPQ 
QALVIREGEKMQVNAEEVWGDLVEIKGGDRVPADLRIISAHGC 
KVDNSSLTGESEPQTRSPDCTHE\NPLKTRNITFFSNNFVEGTA 
RGWVATGDRTVMGRIATLASGLEVGKTPIA1EIEHFIQLITGV 
AVFLGVS FFILSLILGYTWLEAVI FLIGI IVANVPEGLLATVTV 
CLTLTAKRflARKNCLVKNLEAVETLGSTSTICSDKTGTLTQNRM 
TVAHMWFDNQIHEADTTEDQSGTSFDKSSHTWVALF*H/LLGFC 
NRPVFKGGQDNI PVLKRDVAGDASESALLKCI ELS SGSVKLMRE 
RNKKVAEIPFNSTNKYQLSIHETEDPNDNRYLLVKKGAPERILD 
RCSTILLQGKEQPLDEEMKEAFQNAYLELGGLGERVLGFCHYYL 
PEEQFPKGFAFDCDDVNFTTDNLCFVGLMSM1GPPRAAVPDAVG 
KCRSAGIKVIMVTGDHPITAKAIAKGVGIIFEGNETVEDIAARL 
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SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 

residue of 
amino acid 
sequence 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 

sequence 


Amino acid segment containing signal peptide - 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K>Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, 
\=possible nucleotide insertion) 








NIPVSQWPRDAKACVIHGTDLKDFTSEQIDEILQNHTEIVFAR 
T3 PQQKL- 1 VEGCQRQGAIVAVTGDGWDSPALKKADIGVAMGI 
AG S DVS KQAADMI LLDDNFAS I VTCVEEGRLI FDNLKKS IAYTL 
TSNIPEITPFLLFIMANIPLPLGTITILCIDLGTDMVPAISLAY 
EAAESDIMKRQPRNPRTDKLVNERLISMAYGQIGMIQALGGFFS 
YFVILAENGFLPGNLVGIRLNWDDRTVNDLEDSYGQQWTYEQRK 
WE FTCHTAFFVS I VWQWAD LIT CKTRRNS VFQQGMKNKI L I F 
GLFEETALAAFLSYCPGMDVALRI-1YPLKPSWWFCAFPYSFLIFV 
YDEIRKLILRRNPGGWVEKETYY 


5867 


3 




LPGRRARGGRGLGWPPAQALDGSRMGKAKVPASKRAPSSPVAKP 
GPVKTLTRKKNKKKKRFWKSKAREVSKKPASGPGAWRPPKAPE 
DFSQNWKALQEWLLKQKSQAPEKPLVISQMGSKKKPKIIQQNKK 
ETSPQVKGEEMPAGKDQEASRGSVPSGSKMDRRAPVPRTKASGT 
EHNKKGTKERTWGDIVPERGEIEHKKRKAK\GQPQPHPPR/IDI 
WFDDVDPADIEAAIGPEAAKIARKQLGQSEGSVSLSLVKEQAFG 
GLTRALALDCEMVGVGPKGEESMAARVSIVNQYGKCVYDKYVKP 
TEPVTDYRTAVSGIRPENLKQGEELEWQKEVAEMLKGRILVGH 
ALHNDLKVLFLDHPKKKIRDTQKYKPFKSQVKSGRPSLRLLSEK 
ILCLQVQQAEIIC3IQDAQAAMRT.YVMVKKEWESMARDRRPLLTA 
PDHCSDDA*QSCPAAAAAPLQRQCDQSQGQITSPQSGNSGETFS 
ESWQRGVAWCY 


58S8 


2122 


833 


L 3 .^HTgDASUSTHAKVPAAAQML/CVTHAMREDLADIWYIR 
AVTVYDKPASFFKETPLDLQHRLFMKLGSMHSPFRARSEPEDPV 
TERSAFTERDAGS3LVTRLRERPALLVSSTSWTEDEEFSILLAA 
i ESE " :l T\MTLDGHNLPSLVCVITGKGPLREYYSRLIHQKHFQH 
IQVCTPWLEAEDYPLLLGSADLGVCLHTSSSGLDLPMKVVDMFG 
CCLPVCAVNFKCLHELVKHEENGLVFEDSEEIAAQLQMLFSNFP 
DPAGKLNQFRKNLRESQQLRWDESWVQTVLPLVMDT 


58SS 


2122 


833 


LTAGASHTQDASQSTSAKYPAAAQNL/CVTNAMREDLADIWYIR 

AVTVYDKPASFFKETPLDLQHRLFMKLGSMHSPFRARSEPEDPV 

TERSAFTERDAGSGLVTRLRERPALLVSSTSWTEDEDFSILLAA . 

LESRV*T\MTLDGHNLPSLVCVITGKGPLREYYSRLIHQKHFQH 

IQVCTPWLEAEDYPLLLGSADLGVCLHTSSSGLDLPMKVVDMFG 

CCLPVCAVNFKCLHELVKHEENGLVFEDSEELAAQLQMLFSNFP 

DPAGKLNQFRKNLRESQQLRWDESWVQTVLPLVMDT 


5870 


2122 


833 


LTAGASHTQDASQSTSAKYPAAAQNL/CVTNAMREDLADIWYIR 
AVTVYDKPASFFKETPLDLQHRLFMKLGSMHSPFRARSEPEDPV 
TERSAFTERDAGSGLVTRLRERPALLVSSTSWTEDEDFSILLAA 
LESRV*T\MTLDGHNLPSLVCVITGKCPLREYYSRLIIIQKHFQH 
I QVCTPWLEAEDYPLLLGSADLGVCLHTS SSGLDLPMKWDMFG 
CCLPVCAVNFKCLHELVKHEENGLVFEDSEELAAQLQMLFSNFP 
DPAGKLNQFRKNLRESQQLRWDESWVQTVLPLVMDT 


5871 


3 


3465 


FFFCRPLRLYSKTTGDRSAMAGAAGLTAEVSWKVLERRARTKRS 
VLKLL*LSLRRL*LEPTI*NGLLT*CSRLSVFRFLKV\GSVYEP 
LKSINLPRPDNETLWDKLDHYYRIVKSTLLLYQSPTTGLFPTKT 
CGGDQKAKIQDSLYCAAGAWALALAYRRIDDDKGRTHELEHSAI 
KCMRG I LYCYMRQADKVQQFKQDPRPTTCLHSVFNVHTGDELLS 
YEEYi 'IILQINAVSLYLLYLVEMISSGLQI IYNTDEVSFIQNLVF 
CV\ERVYRVP\DFG\VWGKREGKYY*/SGSTELHSSSVGLGKRQ 
L*KQFKGFNIjFGNQGCSWSVIFVDLDAHkRNRQTLCSLLPRESR 
SHNTDAALLPCISYPAFALDDEVLFSQTLDKWRKLKGKYGFKR 
FLRDGYRTSLEDPNRCYYKPAEIKLFDGIECEFPIFFLYMMIDG 
KQVQEYQDLLTPVLHHTTEGYPWPKYYYVPADFVEYE 
KNNPGSQKRFPSNCGRDGKLFLWGQALYIIAKLLADELISPKDI 
DPVQF.YVPLKD3RNVSMRFSNQGPLENDLWHVALIAESQRLQV 
FLNTYGIQTQTPQQVEPIOIWPOQELVKAYLQLGINEKLGLSGR 
PDRPIGCLGTSKIYRILGKTWCYPIIFDLSDFYMSQDVFLLID 
DIKNALQFIKQYWKMHGRPLFLVLIREDNIRGSRFNPILDMLAA 
LKKGIIGGVKVHVDRLQTLISGAWEQLDFLRISDTEELPEFKS 
FEELEPPKHSKVKRQSSTPSAPELGQQPDVNISEWKDKPTHEIL 
QKLNDCSCLASQAILLGILLKREGPNFITKEGTVSDHIERVYRR 
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SEQ 
ID 


Predicted 

nucleotide ' 

corresponding 

amino acid 
residue of 

sequence 


Predicted end 
nucleotide 

corresponding 
to first 

residue of 
amino acid 


Amino acid segment containing signal peptide 
(A=Alanine, OCysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stcp 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion) 








AGSQXLWSWRRAASLLSKWDSLAPSITNVLVQGKQVTLGAFG 
KEEEVISNPLSPRVIQNI I YYKCNTHDEREAVIQQELVIHIGWI 
ISNNPELFSGTLKIRIGWIIHAMEYELQIRGGDKPALDLYQLSP 
SEVKQLLLDILQPQQNGRCWLNRRQIDGSLNRTPTGFYDRVWQI 
LERTPNGIIVAGKHLPQQPTLSDMTMYEMNFSLLVEDTLGNIDQ 
PQYRQIWELLMWSIVLERNPELEFQDKVDLDRLVKEAFNEFQ 
KDQSRLKEIEKQDDMTSFYNTPPLGKRGTCSYLxEOWMNLLLEG 
EVKPNNDDPCLIS 


5872 


68 


665 


vqgymyrfvikinscysekts1crhrccpelpatqpkptptvff 
niaidseslgci\sfklfadkv/pkrwk™fvllntgekvlgdk 
gpcfyriipg\lcqggdfthhngtggkslyskefddenfl/likh 
tapgvlstanagpttngsqffictaktedg*qhwfgkvkdgms 
ivealersgsrngktskkitaancgql 


5873 


224 0 


5C6 


rrppeggsgggrrtrarmplpwslalplllswvaggfgnaasar 
hhgllasarqpgvchygtk1a.ccygwrrnskgvceatcepgckf 
3ecvgpnkcrcfpgytgktcsqdvnecgmkprpcqhrcvnthgs 
ykcfclsghmlmpdatcvnsrtcamincqyscedteegpqclcp 
ssglrlapngrdcldidecasgkvicpynrrcvntfgsyyckch 
igfelqyisgrydcidihectmdshtcshhancfntqgsfkckc 
kqgykgnglrcsai pensvkevlrapgtikdri kkllahknsmk 
kkakiknvtpeptrtptpkvnlqpfnyeeivsrggnshgg\kkg 
neekmkegledekreekabkd*hrrerpfkg\dvffpkvneage 
fglilWqrkaltsklehicadlnisvdcsfnhgVicdwXkqdrx 
eddfdw\npadr\dnai\gpy\mavpglwqghk\kdigrlklll 
pdlqpqsnjfcllfdyriagdkvgklrvfvknsnhaiawekttse 
dekwktgkiqlyqgtdatksiifeaergkgktgeiavdgvllvs 
glcpdsllsvdd 


5874 


2 


:-3 37 


ACPRLARRRRRVRSLRRRRGWLRARWSRGQNNMAARRITQETFD 
AVLQEKAKR YHMDAS GEAVS E TLCFKAQDLLRAVPRS RAEMYDD 
VHSDGRYSLSC-SVAHSRDAGRESLRSDVFSGPSFRSSNPSISDD 
SYFRKECGRDLEFSHSNSRDQVIGHRKLGHFRSQDWKFALRGSW 
EQDFGHPVSQESSWSQEYSFGPSAVLGDFGSSRLIEKECLEKE\ 
SRDYDVDHSG\EA\DSVLRGS\SQVQA\RGRALNIVDQEGSLLG 
. KGETQGLLTAKGGVGKLVTLRNVSTKKI PTVNRITPKTQGTNQI 
QKNTPS PDVTLGTNPGTED IQFPIQKI PLGLDLKNLRLPRRKMS 
FDI IDKSDVFSRFGI E 1 1 KWAGFHTI KDD I KFSQLFQTLFELET 
ETCAKMLASFKCSLKPEHRDFCFFTIKFIjKHSALKTPRVDHEFL 
NMLLDKGAVKTKNCFFE 1 1 KP FDKY IMRLQDRLLKS VTPLLMAC 
NAYEL£VKI1KTLSNPT,2)T,ALALETTNSI,CRKSLALLGQTFSLAS 
S FRQE KI L * AVGLQD I APS PAAFPNFEDSTLFGRE Y IDHLKAWL 
VSS3CPLQVKKAEPEPMREEEKMIPPTKPEIQAKAPSSLSDAVP 
QRADHRWGTIDQLVKRVIEGSLSPKERTLLKEDPAYWFLSDEN 
SLEYXYYKLKIAEMQRMSENLRGADQKPTSADCAVRAMLYSRAV 
!'.! - r KLLP\WQRRGLLRAQG\LRG\WKARRA\TTGTQTLLFLR 
APGLKHKGRQAPGLS\QAKPSLPDRND\AAKD\CPLDPV\GPSP 
QDP3 LEASGPSPKPAGVD ISEAPQTSS PCPSADIDMKDNGRTAE 
KLARFVAQVG\PEIEQF\SI\EWSTDNPDLWFL\HDQMSS\AFK 
FY\RKKVFELCPSICFTSSPHNL\HTGGGDTT\GSQESPVDLME 
GEA3FEDEPPPREAELESPEVMPEEEDEDDEDGGEEAPA\PGRG 

gpslegstpadglpgea\aeddl/algapalftgllqvtcfpfg 
kfllkvgmipapkrvcliqepkvhepvriaydrprgrpms 
e:kkxpkdldfaqqkl\tdk\nlgfq\mlokmgwkeghglgslgk 
gir\srsactqqaawggsgwglspstcslplgsftakmayswql 

IFVF 


5875 


296 


1848 


LAALG3LPLWRLSRRGFREYLLGLSAPSALGGAMRSVSYVQRVA 
LEFSGSLFPHAICLGDVDWDTLNELWGDTSGKVSVYKNDDSRP 
WLTCSCQGMLTCVGVGDVCNKGKNLLVAVSAEGWFHLFDLTPAK 
VLDASGHHETLIGEEQRPVFKQHIPANTKVMLISDIDGDGCREL 
WGYTDRWRAFRWEELGEGPEHLTGQLVSLKKWMLEGQVDSLS 
VTLGPL3LPELMVSQPGCAYAILLCTWKKDTGSPPASEGPTDGS 
/SGDP33PRRGAAPDIWPYPQQECLHSPNWQHQT\SHGTESSGS 
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SEQ 
ID 
NO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

amino acid 

sequence 


Predicted end 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, H= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T threonine , V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, * = Stop 
Codon, /-possible nucleotide deletion, 
\=possible nucleotide insertion/ 








glfa1ctldgtlklmeemeeadkllwsvqvdhqlfalekldvtg 
nghe3wacawdgqty11dhnrtwrfqvdenirafcaglyack 
egt?nspclvyvtfnqkiyvykevqlermestnlvkllet:<p\st 
taccr3wawilttsl*lvpcftkrstiqtshhsvlpqasripps 
wtcliagegff*tptlppkgvfgshcaaagsitkq 


587S 


1122 


224 


hlplgvpskvagaaamepqeeretovaawlkkifgdhpipoyev 
kprtt3ilhhlsernrvrdrdvylviedlkqkaseyeseakylq 
dllmeswfspanlsstgsrylnalvdsavaletkdtslasfip 
avhdltsdlfrtkskseeikielekleknltatlvlekclqedv 
kkaelhlster\akvdnrrqnm\dflkak£eefrfgiqaageql 
sargq \ dafsvp i qslval, irenwprlkqqt iplk\ kkles yld 
lmp\n?shcsk*rieeak\rela\sieaeltrrvs\mmel 


S877 


2030 


1907 


GTLGKMAASSSGEKEKERLGGGLGVAGGNSTRERLLSALEDLEV 
LSRELIEMLAISRNQKLLQAGEENQVLELLIHRDGEFQELMKLA 
LNQGKIHHEMQVLEKEVE KRDSDI QQLQKQLKEAEQ I LATAVYQ 
AKE KL KS I E KAR KGA I S S E E 1 1 KYAHR 1 S ASNAVCAP LT WVPGD 
PRRPYPTDLEMRSGLLGQMNNPSTNGVNGHLPGDALA/RRKIAR 
CPCSTVS/NGSQMTCR*INIILILQKSVCEL 


-,. 


350 


2113 


glwkcmqlqgpht:irvqp*ptprqqgpq\vpvaviagnrpnyly 
rmlrsllsaqgvspqmitvfidgyyeepmdwalfglrgiqhtp 
isiknarvsqhykasltatfnlfpeakfawleedldiavdffs 
flsqsihlleeddslycisawndqgybhtaedpallyrvetmpg 
lgwvlrrslykeelepkwptpeklwdwdmwmp.mpeqrrgrecii 
pdvsrsyhfgivglnmngyfheayfkkjikfntvpgvclpjwdsl 
kkeayevevhrllseaevldhsknpcedsflpdteghtyvafir 

MEKDDDFTTWTQLAKCLHIWDLDVRGNHRGLWRLFRKKNHFLW 
GVPASPYSVKKPPSVTPIFLEPPPKEEGAPGAPEQT 


5879 


3 


981 


RLTEAAAAGSGSRAAGWAGSPPTLLPLSPTSPRCAATMASSDED 
GTNGGASEAGEDREAPGKRRRLGFLATAWLTFYDIAMTAGWLVL 
AI AMVRF YME KGTHRGLYKS I QKTLKFFQTFALLE IVHCLIGIV 
PTSVIVTGVQVSSRIFMWLITHSIKPIQNEESWLFLVAWTVT 
3ITRYSFYTFSLLDHLPYFIKWARYNFFIILYPVGVAGELLTIY 
ARLPHVKKTGMFSIRLPNKYNVSFDYYYFLLITMASYIPLFPQL 
YFHMLRQRRKVLHG\G* L* KRMI K+ SLQTRCFFQNNQDYLS PS F 
NNKNKQLCEISWIVWFLKI 


5880 


113 8 


1324 


S LWCLVAGGLGLGPSSQNPLORAG ILARPREARGTFSALTACSA 
SVTSKGKSSSGMWPSAASDRDSPVPLRPPGPVQLPSGTGWVLSD 
*KKKRGRCSS/WLSQPQHEREKEWLLRRSMAEGERARAASDVL 
CRSLANETHQLRRTLTATAHMCQHLAKCLDERQHAQRNVGERSP 
DQSEHTDGHTSVQSVIEKLQEENRLLKQKVTHVEDLNAKKQRYN 
ASRDEYVRGLHAQLRGLQIPHEPELMRKEISRLNRQLEEKINDC 
AEVKQELAASRTARDABiLERVQMLEQQirAYKDDFMSERADRER 
AQSRIQELEEKVASLLHQVSWRQDSREPDAGRIHAGSKTAKYLA 
ADALELMVPGGWRPGTGSQQPEPPAEGGHPGAAQRGQGDLQCPH 
CLQCFSD3QGEELLRHVAECCQ 


5881 


26 




GGIHPSPTEAPRAQHLTMDCTWRILFLVAAATGTHAQVQLLQSG 
SEVKKPGASVMVSCYVSGYTLTKLSMHWVRQAPGRGLE*MGPFD 
LQDVETIYPQKFQGRVSMTEETSTETTQ/AYLELSSLR3EDTAV 
HHCATDTV 


5882 


2407 


2216 


SGCVEKLYSHSLEYKFPEWISVQSAVAPAQLALNSDGDL*LHSGE 
RTRRD*QLPEAGGPGLQEPLQLGELDITSDEFILDEVDG\VDLR 
HYSKQVELELQQIEQKSIRDYIQESENIASLHNQITACDAVLER 
MEQMLGAFQSDLS S I S SE IRTLQEQSGAMNIRLRNRQAVRGKLG 
ELVDGLWPSALVTAILEAPVTEPRFLEQLQELDAKAAAVREQE 
ARGTAACADVRGVLDRLRVKAVTKIRBFILQKIYSFRKPMTNYQ 
I PQTALLKYRFFYQFLLGNERATAKE I RDEYVETLS KIYLSYYR 
SYLGRLMKVQYEEVAEKDDLMGVEDTAKKGFFSKPSLRSRNTIF 
TLGTRGSVISPTELEAPILVPHTAQRGEQRYPFEALFRSQHYAL 
LDNSCREYLFICEFFWSGPAAHDLFKAVMGRTLSMTLKHLDSY 
LADCYDAIAVFLCIHIVLRFRNIAAKRDVPALDRYWEQVLALLW 
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SEQ 
ID 


Predicted 
nucleotide 
corresponding 

residue of 


Predicted end 

corr e sponding 

residue of 
sequence 


Amine ac.d segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine , 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








PRFELI^EMKVQSVRSTDPQRLGGLDTRPHYITRRYAEFSSALV 
SINQTIPNERTMQLLGQLQV2VENFVLRVAAEFSSRKEQLVFLI 
MNYDMMLGVLM \E* ERAADDS KEVES FQQLLNARTQEF I EELLS 
PPFGGLVAFVKEAEALIERGQAERLRGEEARVTQLIRGFGSSWK 
SSVESLSQDVMRSFTNFRNGTSIIQGALTQLIQ\LYHRFHRV\L 
SQPQLRA1PARAELINIHHLMVELKKHKPNF 


5883 




1374 


EFPGRRFRAVMEAGAGAGAGAAGWSCPGPGPTVTTLGSYEASEG 
CERKKGQRWGSLERRGMQAMEGEVLLPALYEEEEEEEEEEEEVE 
EEEEQVQKGGSVGSLSVNKHRGLSLTETELEELRAQVLQLVAEL 
EETRELAGQHEDDSLHLCGLLEDERLASAQQAEVFTKQ.IQQLQG 
ELRSLREEISLLEHEKESELKEIEQELHLAQAE1QSLRQAAEDS 
ATEHESDIASLQEDLCRMQNELEDMERIRGDYEMEIASLRAEME 
MK3SEPSGSLGLSDYSGLQEELQELRERYHFLNBEYRALQESNS 
SLTGQLADLESERTQRATERWLQSQTLSMTSAESQTSEMDFLEP 
DPEMQLLRQQLRDAEEQMHGi»IKNKCCELCCELEELQHHRQVSEE 
EQRRLQRELK3AQNE VLRFQTSHS \SPSHPLPP I PPSS PCLL *A 
LWISALLWCWWAETSS 


5884 


42S1 


2522 


GVj-iARASARLRVPLTGVRACAE P E VGAE PAKVAGAAE P DEDGGR 
SRLRDCGDYTPSERLGPK3AMLWFQGAIPAAIATAKRSGAVFVV 
FVAGDDEQSTQMAASWEDDKVTEASSNSFVAIKIDTKSEACLQF 
SQ I YPWCVPS S FF IGDS 31 PLE VIAGS VS ADELVTRI HKVRQM 
HLLKSETSVANGSQSESSVSTPHASFEPmTCENSQSRNAELCE 
IPSTSDTKSDTATGGESAGHATSSQEPSGCSDQRPAEDLNIRVE 
RLTKKI,EERREEKRKEEEQREIKKEIERRKTGKEMLDYKRKQEE 
EI 1 ILE.-RTJ^EKAEDRAARERIKQQIALDRAERAARFAKTKE 
EVEAAKAAALLAKQAEMEVKRESYARERSTVARIQFRLPDGSSF 
TNQFPSDAPLEEARQFAAQTVGNTYGNFSLATMFPRREFTKEDY 
KKKLLDLELAPSASWLL?/ALFINF*AGRPTASIVHSSSGDIW 
TLLGTVLYPFLAIWRLISNFLFSNPPPTQTSVRVTSSEPPNPAS 
SSKSEKREPVRKRVLEKRGDDFKKEGKIYRLRTQDDGEDENNTW 
NGNSTQQM 


58 35 


900 


467 


aagggrrsrlsrswptgpskspsgvrccg\rr\awedkdefldv 
iywfrqi1awlgviwgvlplrgflgiagfclinagvlylyfsn 
ylq 1 deeeyggtweltkegfmts fa/ 1 vhghldhllhchpl * lm 
vyssqvlpiqskgps 


588S 




1341 


PFRGRALTLKKQPRPGVAPPSLGTCHKSDPGRPAAQSQPPSPGS 
3TFGLLSFRMVRTKTWTLKKHFVGYPTNSDFELKTSELPPLKNG 
EVLLEALFLTVDPYMRVAAKRLKEGDTMMGQQVAKWESKNVAL 
PKGT IVLAS PGWTTHSI SDGKDLEKLLTEWPDT I PLSLALGTVG 
MPGLTAYF3LLEICGVKGGETVMVNAAAGAVGSVVGQIAKLKGC 
KWGAVGSDEKVAYLQKLGFDWFNYKTVESLEETLKKASPDGY 
3CYFDNVG3EFSNTVIGQMKKFGRIAICGAISTYNRTGPLPPGP 
PPEIGIYQELRMEAFWYRWQGDARQKALKDLLKWVLELPYFVI 
D*LQANTLVYKSMKSAKPSLEYISEKLVSG\KIQYKEYIIEGFE 
NMPAAFMGMLKGDKLGKTIVKA 


5887 


193 7 


104 


- .P 3C F 3 CRATRCPCRGPRWDSLGDEAARS PAAPGGAPGLLGLRE 
RPDRCHPGGDDRGPQLHRGSPG/ SPSELSRRPGPPGLPGLQGPP 
PAPGLPQSRTL/PVLCVCDLSPAQCDINCCCDPDCSSVDFSVFS 
ACSVPWTGDSQFCSQKAVIYSLNFTANPPQRVFELVDQIKPSI 
FCIHITN\*NLHYPLLIQKYL/NENNFDTLMJCrSDGFTLNAESY 
VSFTTKLDIPTAAKYEYGVPLQTSDSFLRFPSSLTSSLCTDNNP 
AAFLVTCQAVKCTRKINLEQCEEIEALSMAFYSSPEILRVPDSRK 
KVPITVQSIVIQSLNKTLTRREDTDVLQPTLVNAGHFSLCVNW 
LEVKYSLTYTDAGEVTKADLSFVLGTVSSi/WPLQQKFEIHFLQ 
ENTQPVPLSGNPGYWGLPLAAGFQPHKGSGIIQTTNRYGQLTI 
LHSTTEQDCLALEGVRTPVLFGYTMQSGCKLRLTGALPCQLVAQ 
KVKSLLWGQGFPDYVAPFGNSQGP/ADMLDWVPIHFITQSFNRK 
DSCQLPGALVIEVKWTKYGSLLNPQAKIV1TVTANLISSSFPEAN 
SGNERTILISTAVTFVDVSAPAEAGFRAPPAINARLPFNFFFPF 
V 
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SEQ 
IJO: 


Predicted 

beginning 

nucleotide 

location 

corresponding 

to first 

residue of 


Predicted end 
nucleotide 
location 
corresponding 
to first 
amino acid 
residue of 
amino acid 


Ammo acid segment containing signal peptide 
{A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine , G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /=possible nucleotide deletion, ' 
\=possible nucleotide insertion) 


5888 


375 


2302 


LLCRTPGVAMQRADSEQPSKRPRCDDSPRTPSNTPSAEADWSPG 
LELHPDYKTWGPEQVCSFLRRGGFEKPVLLKNIRENEITGALLP 
CLDESRFENLGVSSLGERKRLLSYIQRLVQIHTOTMKVINDPIH 
GHIELHPLLVRIIDTPQFQRLRYIKQLGGGYYVFPGASHNRFEH 
SLGVGYLAGCLVHALGEKQPELQISERDVLCVQIAGLCHDLGHG 
PFSHMFDGRFIPLARPEVKWTHEQGSVMMFEHLINSNGIKPVME 
QYGLIPEEDICFIKEQIVGPLESPVEDSLWPYKGRPENKSFLYE 
IVSNKRNGIDVDKWDYFARDCHHLGIQNNFDYKRFIKFARVCEV 
DNELRICARDKEVGNLYDMFHTRNSLHRRAYQHKVGNIIDTMIT 
DAFLKADDYIEITGAGGKKYRISTAIDDMEAYTKLTDNIFLEIL 
YSTDPKLKDAREILKQIEYRNLFKYVGETQPTGQIKIKREDYES 
LPKEVASAKPKVLLDVKLKAEDFIVDVTNMDYGMQEKNPIDHVS 
FYCKTAPNRAIRITKNQVSQLLP\EKFAEQ\LIRVYCKKVDRKS 
LYA\ARQYFVQW\CADR\NFT\KPQDGRCY*PPTP*HPQKKGW\ 
NDSTFSPKIPTRLPRRLPKSRV\QLFKDDPM 


5889 


1831 


731 


LPAACGRPVTARPRQAPEGRSGRPRDLDPYPPQVFPPRPDRVAI 
VTGGTDGIGYS TAKHLARLGMHVI IAGNNDS KAKQWSK1 KEET 
LNDKET*VLLCCPGWLCLWNSSDPPTSASRGAGTTGVHHHFLLK 
FGIFIL\DIASKTSIRQFVQKFKMKKI?IiHVLIlsrNAGVMMVPQR 
KTRDGFEEHFGLNYLGHFLLTNLLLDTLKESGSPGHSARWTVS 
SATHYVAELNMDDLQSSACYSPHAAYAQSKLALVLFTYHLQRLL 
AAEGSHVTANWDPRVVNTDT.YKHVFWATRLAKKLLGWLLFKTP 
DEGAWTSIYAAVTPELEGVGGRYLYNKKETKSLHVTYNQKLQQQ 
LWSKSCEMTGVLDVTL 


5890 


13 2 2 


200 


FRRGWSAAGRAVPVAFCSRI3ASSPRRPRGAVRLQSGTEAACRS 
GRPDPRPASAAGGHAGERMSQRDTLVHLFAGGCGGTVGAILTCP 
LEWKTRLQSSSVTLYISEVQLNTMAGASVNRVVSPGPLHCLKV 
ILEKEGPRSLFRGLGPNLVGVAPSRAIYFAAYSNCKEKLNDVFD 
PDS TQVHMI SAAMAGFTAI TATNPI WL I KTRLQL* / SQGTAGKR 
RMGAFECVRKVYQTDGLKGFYRGMSASYAGISETVIHFVIYESI 
KQKLLEYKTASTMENDEESVKEASDFVGMMLAAATSK\LVATTI 
AYPHEWRTRLREEGTKYRSFFQTLSLLVQEEGYGSLYRGLTTH 
LVRQ I P \NTAIMMATYELWYLLNG 


58S1 


1322 


200 


FRRGWSAAGRAVPVAFCSR I SAS SPRRPRGAVRLQSGTEAACRS 
GRPDPRPASAAGGHAGERMSQRDTLVHLFAGGCGGTVGAILTCP 
LEWKTRLQSSEVTLYISEVQLNTMAGASVNRWSPGPLHCLKV 
ILEXEGPRSLFRGLGPNLVGVAPSRAIYFAAYSNCKEKLNDVFD 
PDSTQVHMISAAMAGFTAITATNPIWLI KTRLQL* /SQGTAGKR 
RMGAFECVRKVYQTDGLKGFYRGMSASYAGISETVIHFVIYESI 
KQKLLEYKTASTMENDEESVKEASDFVGMMLAAATSK\LVATTI 
AYPHEWRTRLREEGTKYRSFFQTLSLLVQEEGYGSLYRGLTTH 
LVRQIP\NTAIMMATYELWYLLNG 


5892 


1764 


" 


WLRVCGRLSVNSAVSSRTGGWSAGLTCAMQRLQWLGHLRGPA 
DSGWMPQAAPCLSGAPHASAADWWHGRRTAICRAGRGGFKDT 
TPDELLSAVMTAVLKDVNLRPEQLGDICVGNVLQPGAGAIMARI 
AQFLSEIPETVPLSTVMRQCSSGLQAVAS-AGGIRNGSYDIGMA 
CGVESMSLADRGNPGNITSRLMEKEKARDCLIPMGITSENVAER 
FGISREKQDTFALASQQKAARAQSKGCFQAEIVPVTTTVHDDKG 
TKRSITVTQDEGIRPSTTMEGLAKLKPAFKKDGSTTAGNSSQVS 
DGAAAILLARRSKAEELGLPILGVLRSYAWGVPPDIMGIGPAY 
AI PVALQKAGLTVS DVD IFEINE \AFASQAAYC VEKLRL P P * EG 
+ TPLGGASGP*GHPLGLHWGHVQVITLAQ*S*SARGKRAYRSGC 
P C A I G S WNG S P LP VFE YPWGT 


5893 


3 


1653 


I - I fc JI/i^KVa 1 IGAGVSGLISLXCCVDEGLEPT 
CFERTEDIGGVWRFKENVEDGRASIYQSWTNTSKEMSCFSDFP 
MPEDFPNFLHNSKLLEYFRIFAKKFDLLKYIQFQTTVLSVRKCP 
DFSSSGQWKWTQSNGKEQSAVFDAVMVCSGHHILPHIPLKSFP 
GMERJKGQYFHSRQYKHPDGFEGKRILVIGMGNLGSDIAVELSK 
NAAQVFISTRHGTWVMSRISEDGYPWDSVFHTRFRSMLRNVLPR 
TAVKWMIEQQMNRWFNHENYGLEPQNKYIMKEPVLNDDVPSRLL 
CGAIXVKSTVKELTETSAIFEDGTVEENIDVIIFATGYSFSFPF 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 


Predicted end 
nucleotide 

corresponding 
to first 
amino acid 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidins, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine , W=Asparagine, 
P=Proline, Q=Glutamine, E=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








LEDSLVKVENNMVSLYKYIFPAHLDKSTLACIGLIQPLGSIFPT 
AELQARWVTRVFKGLCSLPSERTMMMDIIKRNEKRIDLFGESQS 
QTLQTNYVDyLDELALEIGAKPDFCSLLFKDPKLAVRLYFGECN 
SY*YRLVGPGQWEGARNAIFTQKQRILKPLKTRALKDSSNFSVS 
FLLKI LGLLAWVAFF\ CQLQWS 


5894 


174 


1673 


RYSPKKVLQNKESSLKLGMATALVSAHSLAPLNLKKEGLRWRE 
DHYSTWEQGFKLCGNSKGLGQEPLCKQFRQLRYEETTGPREALS 
RLRELCQQWLQPETHTKEHILELLVLEQFLIILPKELQARVQEH 
HPESREDVWJLEDLQLDLGETGQQVDPDQPKKQKILVESMAPL 
KGVQEQQVRHE CEVTKPEKEKGEETRI ENGKL IWTDS CGRVES 
SGKISEPMEAHNEGSNLERHQAKPKEKIEYKCSEREQRFIQHLD 
LIEHASTHTGKKLCESDVCQSSSLTGHKK.VLS*ERKVIQC\HGV 
LGKAFQRSSHLVRHQKIHLGEKPYQCNECGKVFSQNAGLLEHLR 
IHTGE KP YLC IHCGKNFRRSSHLNRHQR I HSQEEPCE CKECGKT 
FSQALLLTHHQRIHSHSKSHQCNECGKAFSLTSDLIRHHRIHTG 
EKPFKGNICQKAFRLMSHIiAQHVRIHNrEEKPYQCSECGEAFRQR 
SGLFQHQRYHHKDKLA 


5895 


2967 


86 


hpsllgaipfypppsspwppplylfwnshrksrhfinqrgihge 
mrlfv3dgvpqclpvlaaaqrar3ra3vlistvgpedcwpflt 
rpkvpvlqiidsgnylfstsaicryff\llsgweqddltnqwlew 
eatelqptlsaalyyl\wqgkkg\edvlgsvrrtlthidhsls 
rq\ncpflagetesladivlwgalypllqdpaylpeelsalhsw 
fqtlstq\epcqr\aarrlvlkq\qgvialr\pylqkqpqpspa 
egkglspiepeeeelatlseeeiamavtawekgleslpplrpqq 
npvlpvagernvlitsalpyvnnvphlgniigcvlsadvfarys 
rlrqwntlylcgtdeygtatetkal\eegltpqeicdkyhiiea 
diy\rwfn1sfdifgrtttpqq\tkit\qdifqqllkrgfvlqd 
tveqlrcehcarf\ladrfvegvcpfcgyeeargdqcdkcgkli 
navelxkpqckvcrscpwqssqhlfldlpklekrleewlgrtl 
pgsdwtpnaqfitpffgfrewpskprnq*trdlk\wgnpgtp*e 
gfedk\ vfyvwfdat i gylsitanytdqwerww \ knpeqvdlyq 
fm\akdnvpfhslvfpssalgaednytl\vshliateylnyedg 
k\fsksrgvgvfrdm\ahdtgippdisrfyl\lyirpegk\dsa 
fswtdlllknns\ellnnlgnfinra\gmfvskffgg\yvpemv 
ltpddqrlla\hvtlelqhyhq\llekvrirdalrsiltis\rh 
3nqy i \ q vnepw\ kr i kgseadrqragtvtglavnl aallsvml 
qpymptvsatiqaqlqlpppacsilltnflctlpaghqigtvsp 
lfqklendqieslrqrfgggoaktspkpawetvttakpqqiqa 
lmdevtkqgnivrelkaqkadknevaaevaktjldlkkqlavaeg 
kppeapkgkkkk 


5895 


2967 


86 


HPSLLGAIPFYPPPSSPWPPPLYLFWNSHRKSRHFINQRGIHGE 
MRLFVSDGVPGCLPVLAAAGRARGRAEVLISTVGPEDCVVPFLT 
RPKVPVLQLDSGNYLFSTSAICRYFF\LLSGKEQDDLTNQWLEW 
SATELQPTLSAALYYL\WQGKKG\EDVL3SVRRTLTHIDHSLS 
RQ\NCPFLAGETESLADIVLWGALYPLLQDPAYLPEELSALHSW 
FQTLSTQ\EPCQR\AARRLVLKQ\QGVLALR\PYLQKQPQPSPA 
EGKGLSPIEPEEE2IATLSEE2IAMAVTAWEKGLESLPPLRPQQ 
KPVLPVAGERNVLITSALPYVNNVPHLGNIIGCVLSADVFARYS 
RLRQVIIITLYLCGTDEYGTATETKAL\EEGLTPQEICDKYHIIHA 
DIY\RWFNISFDIFGRTTTPQQ\TKIT\QDIFQQLLKRGFVLQD 
TVEQLRCEHCARF\LADRFVEGVCPFCGYEEARGDQCDKCGKLI 
J..V..ILKKP2CKVCRSCPWQSS0HLFLDLPKLEKRLEEWLGRTL 
PGSDWTPIS1AQFITPFFGFREW?SKPRWQ*TRELK\WGNPGTP*E 
G FEDK \ VF Y VW FDAT IG YLS I TANYTDQWERWW \ KNPEQVDLYQ 
FM \ AKDNVP FKSLVFPS S ALGAEDN YTL \ VSHL I ATE YLN YEDG 
K\FSKSRGVGVFRDM\AHDTGIPPDISRFYL\LYIRPEGK\DSA 
FSWTDLLIiKNKS\ELLNNLGNFINRA\GMFVSKFFGG\YVPEMV 
LTPDDQRLLA\HVTLELQHYHQ\LLEKVRIRDALRS I LT I S \RH 
GNQYI\QVNEPW\KRIKGSEADRQRAGTVTGLAVNIAALLSVI1L 
QPYMPTVSAT1QAQLQLPPPACSILLTNFLCTLPAGHQIGTVSP 
L FQKLEWDQ I E S 1RQRFGGGQAKTS P KPAWET VTTAKPQQ I QA 
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SEQ 
ID 
NO: 


Predicted 
beginning 
nucleotide 

corresponding 
to first 
amino acid 
residue of 
amino acid 
sequence 


Predicted end 
nucleotide 

cor re sponding 
to first 

residue of 

sequence 


Amino acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine , K=Lysine, 
Ii=Leucine, M=Methionine , N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








LKDEVTKQGNIVRELKAQKADKNEVAAEVAKLLDLKKQLAVAEG 
KPPEAPKGKKKK 


5897 


29S7 


86 


HPSLLGAIPFYPPPSSPWPPPLYLFWNSHRKSRHFINQRGIHGE 
MRLFVSDGVPGCLPVLAAAGRARGRAEVLISTVGPEDCWPFLT 
RPKVPVLQLDSGNYLFSTSAICRYFF\LLSGWEQDDLTNQWLEW 
EATELQPTLSAALYYlA WQGKKG \EDVLGSVRRTLTHI DHS LS 

rq\ncpflagetesladivlwgalypllqdpaylpeelsalhsw 
fqtlstq\epgqr\aarrlvlkq\qgvialr\pylqkqpqpspa 
egkglspiepeeeelatlseeeiamavtawekgleslpplrpqq 
npvlpvagerwlitsalpyvnnvphlgniigcvlsadvfarys 
rlrqwntlylcgtdeygtatetkal\eegltpqeicdkyhiiha 
diy\rwfnisfdifgrtttpqq\tkit\qdifqqllkrgfvlqd 
tveqiircehcarf\ladrfvegvcpfcgyeeargdqcdkcgkli 
navelkkpqckvcrscpwossohlfldlpklekrleewlgrtl 
pgsdwtpnaqfitpffgfrewpskprwq*trdlk\wgwpgtp*e ' 
gfedk\vfyvwfdatigylsitamytoqwerww\knpeqvdlyq 

FM\AKDNVPFHSLVFPSSALGAEDNYTL\VSHLIATEYIjNYEDG 
K\FSKSRGVGVFRDM\AHDTGIPPDISRFYL\LYIRPEGK\DSA 
FSWTDLLLKNNS \ ELLNNLGNF INRA\GMFVS KFFGG \ YVPEMV 
LTPDDQRLLA\ HVTLELQHYKQ \ LLEKVR I RDALRS I LT I S \ RH 
GNQYI\QWEPW\KRIKGSEADRQRAGTVTGLAVNIAALLSVML 
QPYMPTVSATIQAQLQLPPPACSILLTNFLCTLPAGHQJGTVSP 
LFQKLENDOIESLRQRFGGGQAKTSPKPAWETVTTAKPQQIQA 
LMDEVTKQGN I VR E LKAQKAD KNE VAAE VAKLLDLKKQLAVAEG 
KPPEAPKGKKKK 


5898 


2967 


86 


HPSLLGAIPFYPPPSSPWPPPLYLFWNSHRKSRHFINQRCIIIGE 
MRLFVSDGVPGCLPVLAAAGRARGRAEVLISTVGPEDCWPFLT 
RPKVPVLQLDSGNYLFSTSAICRYFF\LLSGWEQDDLTNQWLEW 
EATELQPTLSAALYYL\WQGKKG\EDVLGSVRRTLTHIDHSLS 
RQ\NCPFLAGETESLADIVLKGALYPLI)QDPAYLPEELSALHSW 
FQTLSTQ\EPCQR\AARRLVLKQ\QGVLALR\PYLQKQPQPSPA 
EGKGLSPIEPEEEELATL3EEEIAMAVTAWEKGLESLPPLRPQQ 
NPVLPVAGERNVLITSALPYVNNVPHLGNIIGCVLSADVFARYS 
RLRQWNTLYLCGTDEYGTATETKAL\EEGLTPQEICDKYHIIHA 
DIY\RWFNISFDIFGRTTTPQQ\TKIT\QDIFQQLLKRGFVLQD 
TVEQLRCEHCARF\LADRFVEGVCPFCGYEEARGDQCDKCGKLI 
NAVELKKPQCXVCRSCPWQSSQHLFLDLPKLEKRLEEWLGRTL 
PGSDWTPNAQFITPFFGFREWPSKPRWQ*TRDLK\HGNPGTP*E 
GFEDK\VFYWFEATIGYLSITANYTDQWERWW\KNPEQVDI,YQ 
FM\AKDNVPFHSLVFPSSALGAEDNYTL\VSHLIATEYLNYEDG 
K\FSKSRGVGVFRDM\AHDTGIPPDISRFYL\LYIRPEGK\DSA 
FSWTDLLLKWNS\ELLKNLGNFINRA\GMFVSKFFGG\YVPEMV 
LTPDD2RLLA\HVTLELQHYHQ\LLEKVRIRDALRSILTIS\RH 
GNQYI\Q\-TJEPW\KRIKGSEADRQRAGTVTGLAVNIAALLSVML 
QPYMPTVSATIQAQLQLPPPACSILLTNFLCTLPAGEQIGTVSP 
LFQIOjENDQIESLRQRFGGGQAKTSPKPAWETVTTAKPQQIQA 
LMDEVTKQGKIVRELKAQKAEKWEVAAEVAKLLDLKKQLAVAEG 
KPPEAPKGKKKK 




326 


1078 


KCPKSKEPNGVRAPSLPSPLRAAMALSDVDVKKQIKHMMAFIEQ 
EAIIEKAEEIDAKAEEEFNIEKGRLVQTQRLKIMEYYEKKEKQIE 
QQKKILMSTMRNQARLKVLRARNDLISDLLSEAKLRLSRIVEDP 
EVYQGLLDKLVLOGLLRLLEPVMIVRCRP\ODLLLVEAAVQKAI 
PEYMTISQKHVEV\QIDKEA*LAVECSWEVWEVYSGNQRIKVSN 
TLESRLDLSAKQKMPEIRMALFGANTNRKFFI 


5900 


64 


1409 


KAASRDSPCLEFCPLCGVSSHDLQHRMWYHRLSHLHSRLQDLLK 
GGVIYPALPQPNFKSLLPLAVHWHHTASKSLTCAWQQHEDHFEL 
KYANT VMRFD YVWLRDHCRSAS C YNS XTHQRS LDTAS VDLC I KP 
KTIRLDETTLFFTWPDGHVTKYDLNWLVKNSYEGQKQKVIQPRI 
LWNAE I YQQAQVPSVDCQS FLETNEGLKKFLQNFLLYGIAFVEN 
VPPTQEHTEKIAERISLIRETIYGRMWYFTSDFSRGDTAYTKLA 
LDRHTDTTYFQEPCGIQVFHCLKHEGTGGRTLLVDGFYAAEQVL 
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SEQ 
ID 

NO: 


Predicted 
beginning 
nucleotide 

corre sponding 
to first 
amino acid 

amino acid 


Predicted end 

corresponding 
to first 
amino acid 
residue of 

sequence 


Ammo acid segment containing signal peptide 
(A=Alanine, C=Cysteine, D=Aspartic Acid, E= 
Glutamic Acid, F=Phenylalanine, G=Glycine, 
H=Histidine, I=Isoleucine, K=Lysine, 
L=Leucine, M=Methionine, N=Asparagine, 
P=Proline, Q=Glutamine, R=Arginine, 
S=Serine, T=Threonine, V=Valine, 
W=Tryptophan, Y=Tyrosine, X=Unknown, *=Stop 
Codon, /^possible nucleotide deletion, 
\=possible nucleotide insertion) 








QKAP^E FELLS KSAI \KHEYIEDVGECHQPHDWDWAQS* ISTHG 
/YKELYLIRYNNYDRAVINTVPYDWHRWYTAHRTLTIELRRPE 
NEFWVKLKPGRVLFIDNWRVLHGRECFTGYRQLCGCYLTRDDVL 
NTARLLGLQA 


B901 


1 


2121 


VAI3CTSLKMMQAVGGAPARPTGEYICKQCGAKYTSLDSFQTHL 
KTHLETVLPKLTCPQCNKEFPNQESLLKHVTIHFMITSTYYICE 
SCDKCFTSVDDLOKHLLDMHTFVFFRCTLCQEVFnSKVSIQLHL 
\AVKHSNEKKVYRCTSCMWDFRNETDLQLHVKHNHLEN£3GKVHK 
CIFCGESFGTEVELQCHITTHSKKYNCKFCSKAFHAIILLEKHL 
REKHCVFETKTPNCGTNGASEQVQKEEVELQTLLTNSQESHNSH 
DGSEEDVDTSEPMYGCDICGAAYTMETLLONHQLRDHNIRPGES 
AIVKKKAELIKGNYKCNVCSRTFFSENGLREHMQTHLGPVKEYM 
CPICGERFPSLLTLTEHKVTHSKSLDTGNCRICKMPLQSEEEFL 
EHCQMHPDLRNSLTGFRCWCMQTVTSTLELKIHGTFHMQKTGN 
GSAVQTTGRGQHVQKLYKCASCLKEFRSKQDLVKLDINGLPYGL 
CAG CVNLSKSASPG INVPPGTNR PGLGQNENLSA 1 EGKGKVGGL 
KTRCS*LATFKF*VLKVELPEPH?KPFHRGVSRPDSNSTQLKTP 
QVSPMPRISPSQSDEKKTYQCIKCQMVFYNEWDIQVHVANHKID 
EGLNHECKLCSQTFDSPAKLQCHLIEHSFEGMGGTFKCPVCFTV 
FVQANKLQQHIFSAHGQEDKIYDCTQCPQKFFFQTELQNHTMTQ 
HSS 


5902 


712 


209 


JJ PPSRPSIRCSIGSTSV£RWLTSLFTYLUHTA1JVU*V*RKF 
I PLKPRQ* ED * MFQSWLHAWGDTLEEAFEQCAMAMFG YMTDTGT 
VEPLQTVEVETQGDDLQSLLFHFLDEWLYKFSADEFFIP\GWGE 
EFSLSKKPQGTEVKAITYSAMQVYNEENPEVFVIIDI 


5903 


210S 


735 


DTPGPSLPSTTAPFSLRSLSFPSRPSYLLPGDPQPLQGRGLPTT 
PALFALSAVPGGAASPMPPSGLRLLPLLLPLLWLLVLTPGRPAA 
GLSTCKTIDMELVIOIKRIEAIRGQILSKLRLASPPSQGEVPPGP 
LPEAVLALYNSTRDRVAGESAEPEPEPEADYYAKEVTRVLMVET 
HNEIYDKFKQSTHSIYMFFNTSELREAVPEPVLLSRAELRLLRL 
KLKVEQHVELYQKYSNNSWRYLSWRLLAPSDSPEWLSFDVTGW 
RQWLSRGGEIEGFRLSAHCSCDSRDNTLQVDINGFTTGR\RGDL 
ATIHGMNRPFLLLMATPLERAQHLQS\SRHRQAL\DTNY\CFSF 
KGGRNCLRC /VHC*HL 1 FRKDL\GW \ XWI \HE \ P KGYHANFC\L 
GPCPY I WE LDTQYSKVLALYNQ\HKPG\ASAAP \ CCVPQALEP \ 
LPIVYY\VGRKPKVEQLSNMIVRSCKCS 


5904 


3 


1126 


MMEEIENAINTFKEEQRLIYEELIKEEKTTMWELSAISRKIDTW 
ALGNS3TE KAFKAI S S KVPVDKVTPS TLPEEVLDFEKFLQQTGG 
RQGAWDDYDHQNFVKVRNKHKGKPTFMEEVLEHLPGKTQDEVQQ 
HEKW* QKF LALEERKKES IQIWKTKKQQKREE I FXLKEKADNTP 
vLFEJKQEDNQKQKEEQRKKQKLAVEAWKKQKSIEMSMKCASQL 
KEEEEKEKKHQKERQRQFKLKLLLESYTQQKKEQEEFLRLEKEI 
REHZiFXAEKRKNAADEISRFQERDLHKLELKlLDRQAKEDEKSQ 
KQRRLAKLKEKVENNVSRDPSRLY/NTHQRLGRTNQKDRTNRLW 
ATSTYPT*GYSNLETRNTEKSMR 


5905 


287 




m. s fp prvnekeivrlrtrgellapaapfdkkcgrenwtvafap 
dg5y7awsqghrtvklvpwsqclqnfllhgtknvtnssslrlpr 
2ns2ggqknkprehiidcgdivwslafgssvpekqsrcvniewh 
rfrfgqdqlllatglnsgrikiwdvytgklllnlvdhtgwrdl 
tfapdgslilvsasrdktlrvwdlrddgn\mmkvlrghqnwvy\ 
scafspdssmlcsvgaskawamlv*lrlcwhhsht3atmvls 
waervasi*tglgatftig*sntlafvlqgvlyvhrcwsmstfcf 
sfflffffkvisptvkyh*llsklifqfygigsltsetnlm*si 
wlsngfsvlffgilsdsrdilrl*fnlkfvliff*k*civsvqk 
kkkpkriallqeerls*dkppssiili*qtevnir:lfrailhs* 
ll i frx * nc i * tys * 1 1 dp fy i qmt ydrg* fgknkmvkf * f i em 
*lyyfhkiafsfcnw*hpcclpkkfhlavnilfacsicfss*a 
qvgdpsll*tsdylkgrcqwsnnlltlrflsvyffknlwsgkk 
reggl*yltlfisvyfs*lvfgingfqysfwklhclyfmfrli 
fkltfnrni*nricmsalinlktdfkltmtlsiffklliiyna* 
ynln*i*qf*ykmchfvlcmse*syniclfiagf\lwnmdkytm 



398 



